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Real Party in Interest 
The Real Party in Interest is The General Hospital Corporation, to which all 
interest in the present application has been assigned by virtue of an Assignment, recorded 
on July 23, 1998 (Reel/Frame 334/0514). 

Related Appeals and Interferences 
There are no currently pending appeals or interferences related to this case, 
although a Petition to Request Withdrawal of Finality under 37 C.F.R. § 1.181 has been 
filed and Appellants bring this to the Board's attention as it may "have a bearing on the 
Board's decision in the pending appeal." 37 C.F.R. § 41.37(c)(ii). 

Status of Claims 

Claims 8, 10-13, 15, 16, 19, and 20 are currently pending. Claims 1-7, 14, 
and 21-28 are withdrawn from consideration. Claims 9, 17, 18, 29, and 30 were 
canceled. 1 

Claims 8, 10-13, 15, 16, 19, and 20 were finally rejected in a Final Office 
Action mailed on July 8, 2003, and are appealed. 

Status of Amendments 
All amendments have been entered and are reflected in the appended 

claims. 



Summary of the Invention 
Appellants' invention generally features a purified and isolated DNA 
encoding an AGE-1 polypeptide having PI-3 kinase activity, and methods of identifying 
compounds that decrease AGE-I expression or biological activity. Specifically, the 
claims on appeal are directed to the following compositions and methods: 

1 Appellants note that the status of the claims provided in the Final Office Action is incorrect. Claims 18, 29, and 
30 were canceled in Appellants' Reply filed June 26, 2002. 
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(i) purified and isolated DNA encoding an AGE-1 polypeptide containing 
SEQ ID NO:l (claim 8), and vectors and cells that include this DNA (claims 10 and 11, 
respectively); 

(ii) methods of producing a recombinant AGE-1 polypeptide using the 
vectors and cells of the invention and recombinant polypeptides produced by such 
methods (claims 12 and 13, respectively); 

(iii) screening methods for candidate compounds that require (a) a 
nematode cell expressing its endogenous AGE-1 DNA; (b) contacting the nematode cell 
with a candidate compound; and (c) measuring AGE-1 gene expression in the nematode 
cell, a decrease in AGE-1 gene expression following contact with the candidate 
compound, compared to AGE-1 gene expression in a nematode cell that is not contacted 
with the candidate compound, identifying the candidate compound as a compound that is 
capable of decreasing AGE-1 gene expression (claims 15, 19, and 20); and 

(iv) screening methods for candidate compounds that require (a) a cell 
expressing a recombinant AGE-1 polypeptide; (b) contacting the cell with a candidate 
compound; and (c) measuring the PI 3-kinase activity of the cell, a decrease in AGE-1 PI 
3 -kinase activity of the cell following contact with the candidate compound, compared to 
AGE-1 PI 3-kinase activity in a cell that is not contacted with the candidate compound, 
identifying the candidate compound as a compound that is capable of decreasing AGE-1 
PI 3-kinase activity (claims 16, 19, and 20). 

Each of these claims is described in the specification. The amino acid 
sequence of an AGE-1 polypeptide, for example, is found at Figure 3 (SEQ ID NO:l), 
and the nucleic acid sequence of an AGE-1 cDNA is found at Figure 4. Cells and vectors 
containing a nucleic acid encoding an AGE-1 polypeptide and methods of using these 
cells and vectors for the production of recombinant AGE-1 polypeptides are provided at 
pages 28-29. 

Methods for screening candidate compounds are provided in Appellants' 
specification at page 31, line 7 to page 34, line 7. In particular, at page 31, lines 15-17, 
Appellants teach methods for measuring AGE-1 expression. At page 31, lines 9-14, 
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Appellants teach that AGE-1 expression may be measured following the addition of 
antagonist molecules either to culture medium or to an animal, for example, a nematode, 
and, at page 31, lines 17-19, Appellants teach that the level of AGE-1 expression in the 
presence of a candidate molecule is compared to the level measured for the same cells in 
the absence of the candidate molecule. In addition, Appellants teach methods for 
identifying compounds that modulate AGE-1 kinase activity in vitro at page 32, lines 8- 
21. Methods for the selection of candidate compounds are provided at page 31, lines 19- 
21, and methods for purifying such compounds are provided at page 32, lines 1-7. 
Appellants teach that the usefulness of compounds that modulate AGE-1 expression can 
be confirmed by testing the compounds in animal models such as nematodes (page 33, 
lines 4 and 5). Finally, at page 33, lines 6-9, Appellants teach that selected compounds 
may be used as therapeutics to decrease the level of native AGE-1 expression and thereby 
increase the longevity of an animal, for example, a human. 

Issues on Appeal 
This appeal presents two issues: 

I. Whether the Office erred in rejecting claims 8, 10, and 1 1 as being 
anticipated by Swinburne (EMBL Accession No. Z66519, October 27, 1995); and 

II. Whether the Office erred in rejecting claims 8, 10-13, 15, 16, 19, and 20 
as being obvious over Swinburne (EMBL Accession No. Z66519, October 27, 1995) in 
view of Johnson et al. (Genetica 91:65-77, 1993). 

Arguments 

The present anticipation and obviousness rejections are based on a central 
factual error that requires reversal. As is discussed below, these rejections turn on a 
primary reference by Swinburne and the assertion that this reference disclosed 
Appellants 5 AGE-1 amino acid sequence prior to Appellants' priority application filing 
date of August 7, 1996. This assertion is incorrect. The sequence provided by 



5 



Swinburne was updated on multiple occasions from its initial submission in October 
1995. As of Appellants' filing date, Swinburne had not submitted a full-length AGE-1 
sequence, nor identified its function. This reference therefore cannot anticipate or render 
obvious Appellants' claims to the AGE-1 sequence of SEQ ID NO: 1 or methods for 
identifying AGE-1 modulatory compounds that measure changes in its gene expression 
or PI-3 kinase activity. The rejections based on Swinburne must be reversed. 

I. The Swinburne Reference Does Not Teach the Claimed Invention 

The case law is clear that, to anticipate a claim, a prior art reference must 
disclose, either expressly or inherently, all of the limitations of the claim. Kalman v. 
Kimberley-Clark Corp., 713 F.2d 760, 218 U.S.P.Q. 781 (Fed. Cir. 1983). 

As indicated above, claim 8 is directed to purified and isolated DNA 
encoding an AGE-1 polypeptide that includes the sequence of SEQ ID NO: 1, and claims 
10 and 1 1 are directed to a vector and cell, respectively, that include that purified and 
isolated AGE-1 DNA. The Office has rejected these claims under 35 U.S. C. § 102(a) as 
being anticipated by Swinburne (GenBank Accession No. Z66519), asserting that this 
GenBank submission disclosed the amino acid sequence of SEQ ID NO: 1 as of October 
27, 1995. This assertion is in error. 

While it is true that the first Swinburne sequence submission occurred in 
October 1995, 2 the nucleic acid sequence of the cosmid "B0334" that is the subject of the 
submission underwent continual updating since it first became available. Exhibit A to 
Appellants' Reply entered November 17, 2003 provides a sequence revision history, 
indicating that this submission was updated multiple times following Appellants 5 priority 
application filing date of August 7, 1996. Exhibit B to Appellants' Reply entered 
November 17, 2003 provides the version of Gene Bank Accession No. Z66519 available 
as of July 29, 1996, a deposit made just prior to the filing of Appellants' priority 
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document (U.S.S.N. 60/023,382). As shown in Exhibit B, at page 4, as of July 29, 1996, 
the polypeptide product of B0334.8 contained just seventy-six amino acids. 3 In contrast, 
the AGE-1 polypeptide sequence of SEQ ID NO:l contains one thousand one hundred 
forty-six amino acids. Thus, contrary to the Office's assertion, the cited Swinburne 
reference Z55419 [gi: 1044812] ~ available as of Appellants 5 priority application filing 
date -- clearly does not disclose SEQ ID NO:l and cannot anticipate Appellants' claims. 

In fact, contrary to the Office's assertion, the nucleic acid sequence of 
AGE-1 was not publicly available at the time Appellants' patent application was filed. 
This point is made clear in Appellants' specification at page 20, lines 1-7, where 
Appellants state: 

The C. elegans Genome Project has sequenced cosmid B0334. Analysis of 
the DNA sequence in the 4 kb region that detected the age-l(mg55) 
breakpoint revealed two putative exons that showed strong sequence 
identity with the last 88 amino acids of mammalian phosphatidylinositol 3- 
kinase (PI 3-kinase)pl 10 catalytic subunit. The region to the right of 
B0334 expected to contain the rest of age- 1 was not cloned in cosmids or 
phage by the C. elegans genome project (emphasis added). 

Appellants, and not Swinburne, were the first to obtain the age-1 nucleic acid and amino 
acid sequences, as evidenced by Appellants' specification at page 20, lines 7-17. 

We isolated genomic phage and cDNA clones extending to the right from 
B0334 and used anchored polymerase chain reaction (PCR) of reverse 
transcribed RNA to isolate and determine the sequence of the coding region 
of age-1. To confirm the splicing pattern of age-1, reverse transcription 
PCR (RTPCR) was used in conjunction with genomic sequencing of 
predicted splice junctions. The sequence predicted by cDNA clones and 
anchored PCR was further confirmed by sequencing genomic fragments 
corresponding to the predicted coding sequence. Because three 
independent cDNA clones end within 30 base pairs of each other and 
because these encode a protein coextensive with mammalian pi 10 (see 
below), we concluded that the assembled age-1 cDNA was likely to be 



2 The GenBank submission history indicates that this sequence was "first seen at NCBI on October 30, 1995," not 
October 27, 1995 as indicated by the Office. Appendix A to Appellants' Reply entered November 17, 2003. 

3 Evidence that cosmid B0334.8 encodes AGE-1 is provided by Exhibit C to Appellants' Reply entered November 
17, 2003, under the heading "Definition." 
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complete. The nucleic acid sequence of the C. elegans age-1 cDNA is 
shown in Figure 4 (emphasis added). 

Indeed, if the age-1 nucleic acid sequence was publicly available, Appellants would not 
have gone to the trouble of cloning and sequencing the gene. 

It is clear that Swinburne, as of Appellants' priority application filing date, 
failed to disclose a nucleic acid sequence encoding SEQ ID NO: 1, as required by claims 
8, 10, and 11. The anticipation rejection in this case has been maintained in error; it 
should be reversed. 

II. Swinburne and Johnson, in Combination, Do Not Suggest the 
Claimed Invention 

To establish a prima facie case of obviousness under § 103, the Examiner must 

demonstrate that the differences between the claimed invention and the prior art are such 

that the subject matter as a whole would have been obvious, at the time the invention was 

made, to a person having ordinary skill in the art. See 35 U.S.C. § 103(a) (Supp. Ill 

1997); In re Dembiczak, 175 F.3d 994, 998, 50 U.S.P.Q.2d 1614, 1616 (Fed. Cir. 1999), 

abrogated on other grounds by In re Gartside, 203 F.3d 1305, 53 U.S.P.Q.2d 1769 (Fed. 

Cir. 2000). Whether or not a claimed invention would have been obvious is a "legal 

conclusion based on underlying factual inquiries including: (1) the scope and content of 

the prior art; (2) the level of ordinary skill in the art; (3) the differences between the 

claimed invention and the prior art; and (4) objective evidence of nonobviousness." Id, 

Claims 8, 10-13, 15, 16, 19, and 20 feature compositions requiring the 

AGE-1 amino acid sequence of SEQ ID NO: 1 and methods of identifying modulatory 

compounds that depend on measurement of AGE-1 gene expression or PI-3 kinase 

activity. These claims stand rejected under 35 U.S.C. § 103 as obvious over Swinburne 

(Gene Bank Accession No. Z66519) in view of Johnson et al. (Genetica 91:65-77, 1993) 

based on the Office's assertion that: 

... it would have been obvious for an artisan of skill to express the DNA of 
Swinburne in a cell and express the protein in a cell, isolate the protein and 
study its function or practice method of identifying compounds that 
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decrease the expression of Age-1 by following the method of Johnson et al 
and routine cell culture methods. An artisan of skill would have been 
motivated to express Age-1 in a cell, isolated Age-1 protein and tested its 
activity because Swinburne identified putative functional domain. 
Additionally, an artisan would have been motivated to practice the 
screening methods for identifying compounds that decrease Age-1 activity 
because Johnson et al teaches that molecular cloning and characterization 
of Age-1 locus will provide significant insights into the molecular basis of 
senescence. (Office Action mailed July 8, 2003, page 4, first paragraph.) 

This rejection should similarly be reversed. 

As detailed above, as of Appellants' priority application filing date, 
Swinburne failed to disclose the amino acid sequence of SEQ ID NO: 1. Swinburne, as 
of that same date, also failed to disclose the nucleic acid sequence encoding SEQ ID NO: 
1; failed to identify any functional domain of AGE-1; and even failed to disclose that the 
age-1 gene was present on cosmid B0334. Swinburne therefore does not teach what the 
Office asserts and does not suggest the invention of any of claims 8, 10-13, 15, 16, 19, or 
20. 

In addition, the secondary reference, Johnson, fails entirely to remedy the 
deficiencies of Swinburne. Johnson merely describes the effects of an age-1 mutation on 
lifespan and maps age-1 to somewhere on chromosome II. Johnson does not provide the 
skilled artisan with the information, requisite motivation, or expectation of success 
required to obtain a nucleic acid sequence encoding SEQ ID NO: 1, to express it in a cell, 
or to identify any functional domains. 

Appellants were the first to genetically map, clone, and sequence age-1 as 
evidenced by Appellants' specification. Appellants carried out three-factor mapping, 
deficiency mapping, physical mapping, breakpoint analysis, and anchored polymerase 
chain reaction of reverse transcribed RNA to isolate and sequence the age-1 coding 
region (pages 19-21, Figures 2A, 2B, 2C). The age-1 amino acid and nucleic acid 
sequences first obtained by Appellants are shown in Figures 3 and 4, respectively. 
Appellants were the first to molecularly characterize mutant alleles of age-1 (page 20, 
line 24, to page 21, line 7); were the first to appreciate that age-1 encodes a 
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phosphatidylinositol 3-kinase (page 21, line 3, to page 22, line 25); and were the first to 
appreciate that a decrease in AGE-1 activity would directly increase lifespan (page 22, 
line 26, to page 23, line 23). The references cited by the Office uniformly fail to 
recognize these key insights, in addition to failing to provide the AGE-1 sequence. 

The obviousness rejection in this case has also been maintained in error, 
and should be reversed. 
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Conclusion 

Appellants respectfully request that the rejection of claims 8, 10-13, 15, 16, 
19, and 20 be reversed. The required fees for this appeal and the filing of an appeal brief 
have been previously submitted. 

If there are any additional charges, or any credits, please apply them to 
Deposit Account No. 03-2095. 

Respectfully submitted, 

Date: {& OcJr&bv Z o&f 

Clark & Elbing LLP 
101 Federal Street 
Boston, MA 021 10 
Telephone: 617-428-0200 
Facsimile: 617-428-7045 




BTaren L. Elbing, Ph 
Ret- No. 35,238 
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Claims Appendix 



8. A purified and isolated DNA which encodes an AGE-1 polypeptide, said 
polypeptide comprising the sequence of SEQ ID NO: 1 . 

10. A vector comprising the purified and isolated AGE-1 DNA of claim 8. 

1 1 . A cell comprising the purified and isolated AGE-1 DNA of claim 8. 

12. A method of producing a recombinant AGE-1 polypeptide, said method 
comprising the steps of: 

(a) providing a cell transformed with the DNA of claim 8 encoding an 
AGE-1 polypeptide, said DNA being expressed in the cell; 

(b) culturing the transformed cell under conditions for expressing the DNA; and 

(c) isolating the recombinant AGE-1 polypeptide. 

13. A recombinant AGE-1 polypeptide produced according to the method of 
claim 12. 

15. A method of identifying an AGE-1 modulatory compound that is capable of 
decreasing the expression of an AGE-1 gene, said method comprising the steps of: 

(a) providing a nematode cell expressing its endogenous AGE-1 DNA, 

(b) contacting said nematode cell with a candidate compound; and 

(c) measuring AGE-1 gene expression in said nematode cell, a decrease in AGE-1 
gene expression in said nematode cell following contact with said candidate 
compound, compared to AGE-1 gene expression in a nematode cell that is not 
contacted with said candidate compound, identifying said candidate compound as a 
compound that is capable of decreasing AGE-1 gene expression. 
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16. A method of identifying an AGE-1 modulatory compound that is capable of 
decreasing AGE-1 PI 3 -kinase activity, said method comprising the steps of: 

(a) providing a cell expressing an AGE-1 polypeptide of claim 8; 

(b) contacting the cell with a candidate compound; and 

(c) measuring the PI 3-kinase activity of said cell, a decrease in AGE-1 PI 3- 
kinase activity of said cell following contact with the candidate compound, compared to 
AGE-1 PI 3-kinase activity in a cell that is not contacted with said candidate compound, 
identifying said candidate compound as a compound that is capable of decreasing AGE-1 
PI 3-kinase activity. 

19. The method of claim 15 or 16, wherein said method is carried out in a 
nematode 

20. The method of claim 15 or 16, wherein said method involves assaying AGE-1 
PI 3-kinase activity in vitro. 
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LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
COMMENT 



CEB0334 41812 bp DNA linear INV 2.7-JUL-1996 

Caenorhabditis elegans cosmid B0334. 

Z66519 

Z66519 GI:1044812 

Gonadotrophin-releasing hormone receptor like protein; oxalyl-CoA 
decarboxylase; Phytoene synthase precursor; potassium channel 
protein; Yeast hypothetical" protein L8167.12 like protein. 
Caenorhabditis elegans 
Caenorhabditis elegans . 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Nematoda; 
Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; 
Rhabditidae; Caenorhabditis. 

1 (bases 1 to 41812) 
Swinburne, J. 

Direct Submission . 

Submitted ( 26-OCT-1995 ) Nematode Sequencing Project, Sanger Centre, 
Hinxton, Cambridge CB10 IRQ, England and Department of Genetics, 
Washington University, St. Louis, MO 63110, USA. E-mail: 
jes@sanger.ac.uk or rw@nematode.wustl.edu 

2 (bases 1 to 41812) 

Wilson, R., Ainscough,R. , Anderson, K . , Baynes,C, Berks, M. , 
Bonfield,J., Burton, J., Connell,M., Copsey,T., Cooper, J., 
Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A., 
Fulton, L., Gardner, A., Green, P., Hawkins, T., Hillier,L., Jier,M., 
Johnston, L., Jones, M., Kershaw, J., Kirsten,J., Laister,N., 
Latreille,P. , Lightning, J. , Lloyd, C, McMurray,A., Mortimore, B . , 
O'Callagha^M. , Parsons, J., Percy, C, Rifken,L., Roopra,A., 
Saunders, D., Shownkeen, R. , Smaldon,N., Smith, A., Sonnhammer , E . , 
Staden,R., Sulston,J., Thierry-Mieg, J. , Thomas, K., Vaudin,M., 
Vaughan,K., Waterston, R. , Watson, A. , Weinstock, L . , 
Wilkinson-Sproat , J. and Wohldmah,P. 

2.2 Mb of contiguous nucleotide sequence from chromosome III of C. 
elegans 

Nature 368 (6466), 32-38 (1994) 
94150718 

Current sequence finishing criteria for the C. elegans genome 
sequencing consortium are that all bases are either sequenced 
unambiguously on both strands, or on a single strand with both a 
dye primer and dye terminator reaction, from distinct subclones. 
Exceptions are indicated by an explicit note. 

IMPORTANT: This sequence is NOT necessarily the entire insert of 
clone B0334. It may be shorter because we only sequence 
overlapping sections once, or longer because we arrange for a small 
overlap between neighbouring submissions. 

The true left end of clone B0334 is at 1 in this sequence. The true 
right end of clone B0334 is at 41812 in this sequence. The true 
right end of clone W02B12 is at 4181 in this sequence. Coding 



sequences below are predicted from computer analysis, using the 
program Genefinder (P. Green, ms in preparation), and other 
available information. 
FEATURES Location/Qualifiers 
source 1.. 41812 

/organism="Caenorhabditis elegans 

/mol_type="unassigned DNA" 

/db_xref="taxon: 6239" 

/chromosome="II" 

/clone="B0334" 

CDS ■ join (1692. .1838,2102. .2239,24 99.-2625,2748. .3271) 

/codon_start=l 
/product="B0334.9" 
/protein_id=" 1044820" 
/db xref="GI:1044820" 

/trans lation="MNLLDYLFNKADILEELGALNNSLLDVPPLMKTFGVIAFFVTST 
HVAGQVLYLMDTVTKNIASHTMILYDYIHEGNIPRTFYSAKCKPITCNDETSISCLPE 
FQGVGS PTTHNG VVFRTLATDKLCPWFSMMS S PI PSGKVSKKEKQHTRTPLRFNCHNE 
LRVCGGNAETHEPFEFQLITYAATLPRCTRLCRKWYVQRCGRQYGHGLIMYSYTKPG 
LIPRPLNSRLKPSTTEWFDSFSPEIRGDIFHLLKRSPISETVRRIHKDYGVHLSPKRL 
LNLRPQSWPI KDQPLVTEWMMQMIRCRHHRI I DSL " 

CDS complement (join (4176. . 4394 , 4819 .. 4 966, 5021 5163, 

5218. .5529,5685. .5735) ) 

/note="similar to potassium channel protein; cDNA EST 

yk95hl2.5 comes from this gene; cDNA EST yk95h!2.3 comes 

from this gene" 

/codon_start=l 

/product="B0334 .2" 

/protein_id=" 1089818" 

/db xref="GI: 1089818" 

/trInslation="MSLETRCRLHLWPSGVRKLLRRIAPQAIIVLILTTLMLVGAAIF 
QSIDPVLGEQSFYEVVFFEFITISTIGYGNQYPQTHASRVFSIFFSILGIPLLWTLG 
NFGKYLTKFYWKTHGWIFSERTESELVNDKDMPGIVIACLYLLTFAIGFFFIPHSGAA 
YSIDDCYFSFISFATVGFGDKVPQIDTFEKFCKVITYLVWGTILNIMLISYVTNWFTQ 
LFARQPFRGTDVEVMIGGQCITVSEITSLVAKEFHASPHQVRSILHDINGIMEDMKTE 

EDSEKS DI LVAQDL " 

CDS complement (join(6522. .6659,7308. .7435,7482. .7635) ) 
/note="cDNA EST ykl21a7.3 comes from this gene" 

/codon_start=l 

/product="B0334 .1" 

/prot ein_id= " 1044813 " 

/db xref="GI:1044813" 

/ trans lat ion= "MRQLLLLALFVSAASAFPFIGSVQAVRVTGKVTCNGQPAENIKV 
KLYEKEIVLDKLLDEKSTDGRGSFTLAGNKKELTAIDPHVNIYHKCNYNGVCYKKLKI 
KI PKSFI SEGETADRTFDIGELNLAGS FSGESTDCLN " 

CDS complement (join(10596. .11180,11230. .12189,12420. .12524, 

12857. .13062,13799. .13814) ) 

/note="similar to oxalyl-CoA decarboxylase; cDNA EST 
yk78fll.3 comes from this gene; cDNA EST ykl02gl.3 comes 
from this gene; cDNA EST ykll7b8.3 comes from this gene; 
cDNA EST yk!02gl.5 comes from this gene; cDNA EST 
yk78fll.5 comes from this gene; cDNA EST ykl34bl0.5 comes 
from this gene; cDNA EST ykll7b8.5 comes from this gene; 
cDNA EST yk27h3.5 comes from this gene; cDNA EST ykl71e8.3 
comes from this gene; cDNA EST ykl71e8.5 comes from this 
gene; cDNA EST ykl64e9.3 comes from this gene; cDNA EST 
ykl64e9.5 comes from this gene" 
/codon_start=l 
/product="B0334 . 3a" 
/protein_id=" 104 4 814 " 



/db xref="GI:1044814" 

/trans la tion= f, MSHHVSRSVSLLVRRSQFSFAPQRGLISVIDKQQCRQITDYSAK 

DSRINSRKLRNHDGSNMDGASITAAALKSQGVEYMFGVVGFPVIEVGMAAQAHGIKYI 

GCRNEQAAAYAAQAMGYLTGKPVALLWSGPGILHAIGGLANATVNCWPVVCIGGTAD 

VDLENRGAFQEWSQQESVRNSCKHVSRPTSLHTIPAHIEKAVRCAMYGRPGAVYVDLP 

GNLVLTSTEEEISFPPQVPLPAPVSIPPIAEIEKAIDTLKSAKKPLVIVGKGAAWSER 

GATQVQQFLTKSKLPWLATPGGKGVASDLHPRFIGQARSLALREADTVFLIGARFNWI 

LHFGLPPRFQKDVKVVQIDICPEEFHQNVKTEVPLLGDIGETLAELTPRLGDWTYDES 

TEWFKKLRENAEKNRAAVEKFVDDHSTPLNYYAAYQPIREFLANNDVIVINEGANTMD 

IGRTMMPSRLPKRRLDAGTFGTMGVGHGFSLAAALWARDHSPKTKVLVVQGDSAFGFS 

AMELETIARYNLPVVTVIINNSGIYRGLLPEDDKAIEGDRTLALPVLSLTAECRYEEM 

CKAFGGAGTVVRTVPEIKAALEKAFQKTDGPTVINALISTDSERKPQADHWLTRSKM" 

complement (join{10596. .11180,11230. .12189,12420. .12524, 

12857. .13062, 13145. .13193)) 

/note="cDNA EST yk31d2.5 comes from this gene; cDNA EST 

yk31d2.3 comes from this gene" 

/codon_start=l 

/product="B0334.3b" 

/protein_id=" 1041815" 

/db xref="GI: 1044815" 

/ 1 r ans 1 a t ion= "MRFLQKMFQSFCRVFGGRS VSLLVRRSQFS FAPQRGLI SVI DKQ 

QCRQITDYSAKDSRINSRKLRNHDGSNMDGASITAAALKSQGVEYMFGWGFPVIEVG 

MAAQAHG I KY I GCRNEQAAAYAAQAMG YLTGKPVALLVVSG PGI LHAI GGLANAT VNC 

WPVVCIGGTADVDLENRGAFQEWSQQESVRNSCKHVSRPTSLHTIPAHIEKAVRCAMY 

GRPGAVYVDLPGNLVLTSTEEEI SFPPQVPLPAPVS I PPIAEIEKAI DTLKSAKKPLV 

IVGKGAAWSERGATQVQQFLTKSKLPWLATPGGKGVASDLHPRFIGQARSLALREADT 

VFLIGARFNWILHFGLPPRFQKDVKWQIDICPEEFHQNVKTEVPLLGDIGETLAELT 

PRLGDWTYDESTEWFKKLRENAEKNRAAVEKFVDDHSTPLNYYAAYQPIREFLANNDV 

IVINEGANTMDIGRTMMPSRLPKRRLDAGTFGTMGVGHGFSLAAALWARDHSPKTKVL 

VVQGDSAFGFSAMELETIARYNLPVVTVIINMSGIYRGLLPEDDKAIEGDRTLALPVL 

SLTAECRYEEMCKAFGGAGTVVRTVPEI KAALEKAFQKTDGPTVINALI STDSERKPQ 

ADHWLTRSKM" 

join (16591. .16664,16715. .16815,16867. . 1704 8 , 17094 .. 17 369) 

/ no te="similar to Yeast hypothetical protein L8167.12 

like; cDNA EST CEMSG11F comes from this gene; cDNA EST 

CEESI49F comes from this gene; cDNA EST CEESI49R comes 

from this gene" 

/codon_start=l 

/product="B0334.4" 

/protein id=" 104 4816 " 

/db_xref="GI: 1044816" 

/translation="MKLWDSPNTSFPYSFDEVASAFWDRYPNSHAKHIISEDVLERQI 

TDNTIVTKKLIVKQGSSILKRVPRWISRMTDIQVVPVIEESVYDKVSKKLVTYTRNVS 

HISLFQLHERCIYKSSEDNQQHHPALLTDVLRSVTVSIDCGRMSSVYEKVLLMGFKKS 

INNTTKGLFEKLEERFGVKNVANEKMKMIKEKIIKSSSNLVTHVQCEEDV" 

join (18553. .18884,18952. .19114,19164. .19427,19857. .20360) 

/note="similar to phytoene synthase precursor" 

/codon_start=l 

/product="B0334.5" 

/protein id=" 1044817 " 

/db_xref="GI: 104 4 817" 

/translation="MRRSTQLSIIHHIRQPCQSNVPRICDAPNLETTATSTTKLFSWN 
SQWTCLRSYSNGRRLGRRFSGITSISARHFSTKSSPFLDTSSNNNISNDKKKTPSRSP 
DFSNIPGARIPKSMPKRTRFLEISKQQADEAFRSCLEMVRKHDIDSYLAILTINKRAQ 
PEIVALNALNVELASIRDKVDTRKGDASAMYRLQFWKDAISSIYGISPLPVPRQPVAI 
ALCS FAAGANS DMLLKLVETRQST I GDRQFS DI NALCEYGKST I GSLLCLQI DALARN 
SPETKVLPb4AYDVAKDLGAAYAIANMIRATHPLLARGIVLLPADVMSLNGATPDSLYK 
KKKLDEMVGMTKDLVNESKRLLIDARSPIEMVPKAVRPALAATGATTDYI IKTIEKNN 
YDI YS PHLQRRNPLLLWSLLVRKLCSKY " 

join (31563. .31968, 32014. .32087, 3364 5. .34083, 34 4 88. .34 591) 



/note="similar to Gonadotropin-releasing hormone receptor 

like protein" 

/codon_start=l 

/product="B0334.6" 

/protein_id="1044818" 

/db xref="GI:1044818" 

/translation="MDELITLEGASQSEQIIGSHDFCNFSNITHHEHDEQSISIVWWS 
NVAVLPVIALIGLACNLLNMAVLTSNKTARRIPSWNLLIALAVCDSLFLIFATLDVTP 
LSIPSLAFSTSFNHFYSRIVLYIRTLASTFYKSSRNIPKSKFLANCQ1LEKIRHVYRH 
IYYRTMDYVSLFAFNVLPIIGLLYMNSRIIFTLRRWDEDSRKYEETKLSDGLIQHDA 
HNNRTMRANAMLFAVVFMLFFCVGPQAPARILFDMYGQYHPKAILYVCLSQQLVFLNA 
SLNFCLYCWSKRYRTLMKQTLKKFLHKLEGVDHPFQINLKQTKSSSAHVTSLEDHHA 

HLIQNV" 

CDS join (36840. .36939,37888. .38078, 38131. .38355, 39373.. 39747) 

/codon_start=l 
/product="B0334.7" 
/protein_id=" 1044819 " 
/db_xref="GI: 1044819" 

/ 1 r ans la t ion= "MFFPVLVGI S AGVLSWFS VPVWIQVFCAQLGLGGLATAI VMLFE 
NRHNTLVRGRFKIEANWIRYTFYAVNG1YALLFLLPVYFRIPDPEWAKLILLKILPCP 
HPSYFQSNVFVFSMDISSVTGTAGIFLLFFLTEVFVLCIHSSYFLMSLKTHMSPTTRK 
LQQKFFIDMIFQTMTHSVWRENVKSEYRIVLRNTSIYDVAVKIVNPNESPKLERYEC 
LVLSNKYTEVAILETPSKAALLRNKPVEIYARAIYPYNRKNIRHWIEGHVSDKPHHLV 

QSLN FVADNAE YS AKT VKKV " 
CDS complement (join( 40596. .40727,41436. .41534) ) 

/note="cDNA EST CEMSC28F comes from this gene" 

/codon_start=l 

/product="B0334 .8" 

/protein_id=" 1167775 " 

/db_xref="GI: 1167775" 

/trans 1 at ion="MWNNRDLFVSLFTLMLGMELPELSTKADLDHLKKTLFCNGESKE 
EARKFFAGIYEEAFNGSWSTKTNWLFHAVKHY" 
BASE COUNT 13058 a 7729 c 7485 g 13540 t 

ORIGIN ^ ^ , ^ 

1 gatcctctcg cgaagaccat tgggctcgag tcggctcttc gctcgtcata ctgattgaga 
61 aatggcgatg tggagaatgt caagcttatg tggacttttt ttcgagaaaa aaaataacgt 
121 ggaatatatt ttttgtttga taaatttcat acacgttttt ggtcaccaaa ctttggtgag 
181 atttttttgg cgtgaatttg aaaatttcga tttattattt tgagagaaaa gtaaattttg 
241 gcaactttca ggcaatattc agaaatattc taacggatta ttttttggta gtagtgaaat 
301 caacttgcaa caaacttggc aacttaccga aacagttgaa acattgccag aaaaaggcac 
361 tatcacactt ttggaaaatg gaatataaag tttgttttag ttttcaaaga ttgctagtat 
421 aattaaagca tctgagagac cgaaaagaga actgcagagc ttgcaaagtt gcgcaaaatc 
481 tgaaaagaaa ttggtaaact ttggtagaaa tttacaaaaa tttgataaca agttggtaaa 
541 catttgccag taaattgact ttacagtact cgtgccacac tttcattgga tgtcacctat 
601 gaattaagat caattgatat tattaatttg ttgataaaag aaaaatgcat tttttgaata 
661 ttgttaaaac atatgcaaat gtaaatatac aagttcaaaa cataatgaac ataacatatt 
721 gaacagaaaa aatactataa gagagaatac tattcgctct tggagctcaa agtcttggaa 
781 aatggaacaa gattgttaaa aatgcgattt gaccccttcg tttcgtagta gacaaaactc 
841 tccttgaagt tcttgagaga aagcttgaga tttggaaagt actttgagca tctgaaaata 
901 taaaaaaatt taaataactt aatcactgaa ttgaagaata caatagttac tacaggtttc 
961 cttttacaca attttatcat tgctactctt gaaaactttg aagcggctga ctttttgtgg 
1021 attcttgttt gactatttaa attctcggat aatttttttt cttatcaaac cgatcaagtt 
1081 tgttcagcat tatgatatta tcaaaaaata caaaatttaa aaaaatgtat ttttgatcta 
1141 ccttgttgac tagactcttt cccttccttc gacgagacag ccaccgcaaa aataacaacg 
1201 tagttccaaa ggtagatcac aaattaatta tgttcacaaa ttaattaatt aattaattat 
1261 aagagctcaa catggatgga gagctgaagc tcgtcaaaca atggatcaat tgctcctgaa 
1321 tatcttttga cggagtatag tgccaaattg cctcgggagt caagactgaa gaaaaaggta 
1381 gattacctga agatctttca acacatcatt gattgtttat ctagattttc tacgtcactc 
1441 gtcacaatga aatggtcgga aatgtgctaa ttcatttgga cttcgtaacc caggaaatgc 
1501 tccggttcct gtatatactc tccacggaaa taatcgtctg cgaagcgctg ttacagatga 
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1561 gcagtgtgga gttgacaggt ttgtaattat 
1621 ttcttccgtt tgagccgaaa ctcatatacc 
1681 ctcgaaacaa gatgaatttg ctcgattatt 
174 i ttggcgctct gaacaacagc ctccttgatg 
1801 tcgcattctt cgtcacttct actcatgttg 
1861 tttccccgtg aacaacaaaa tgaaggaccg 
1921 tgcttcacac tagacgagga taacgataac 
1981 ttgatagaga gaatttcaac acttgcacga 
2041 cgaatagtga aaatttgaat taacaatttt 
2101 ggtgttgtac ctcatggata cagtgacaaa 
2161 tgactacatt cacgaaggga acattccacg 
2221 cacgtgcaat gatgaaaccg tatgttgtta 
2281 aaattacaga atccgcataa cttgacgaaa 
2341 ttgccgacca aaactgcatc tggcggtact 
2401 cggagcgaca acaatcgtca tattgagtat 
2461 aactatcaga aagaaaaaat caatatcttc 
2521 tccaaggtgt cggatctccc accacccaca 
2581 ataagctgtg cccttggttt agtatgatga 
2641 acagcgttta cactagttcg cgactctaaa 
2701 atctaattaa acttgttgag aaattccctg 
2761 caaaaaggaa aaacaacata caagaactcc 
2821 agtctgcgga ggaaatgcag aaactcacga 
2881 ggcaacattg cctcgatgta ctcgcctttg 
2941 tcgtcaatat ggacatggac tcatcatgta 
3001 gccactgaac tctcgattaa aaccttcaac 
3061 aatccgagga gacatatttc acttgctgaa 
3121 gattcataaa gattacgggg ttcatctcag 
3181 aagttggccg attaaagacc aaccacttgt 
3241 ccgacatcat cggataattg atagtttatg 
3301 tgtgcaatat tttgaatttc atttatctcc 
3361 aggttatagg tgtttgataa gtaaatgatt 
3421 aattccgcgc acgattgctc gtattttatg 
3481 gttcatcaaa atttgactga ttgattttat 
3541 tcacagaata aataagttca caacattgtt 
3601 tttagtagaa aacaggttta ggcgtttttt 
3661 gttcacgctt cttgccattt tctgcttatt 
3721 cctaaatctg aaattcccta aaaaatccga 
3781 ttcataggca atttcctgaa ctacgtttac 
3841 attatacctc agccatacct gctagtctca 
3901 actcatgtct ggataccttg tgcctacctc 
3961 tctgctgttc ccaccacatg tctgcctacg 
4021 ttgaatacca ggctgtgctg actaacaaac 
4081 ttctgtaatt ttatttttta aaaaacaccg 
4141 cagctatttc acataaatac aatgttaatc 
4201 tcagatttct ctgaatcctc ttcggtcttc 
4261 agaatcgaac ggacttgatg aggtgatgcg 
4321 tctgatacag taatacattg tcctccgatc 
4381 tgacgggcga agagctggaa gtggatatga 
4441 gtttaaacat tctaggattc gaaccaaggg 
4501 tcggcaaatt cggcaaatct tttttcaata 
4561 ggccaactcg gcaaatggcc aattcggcaa 
4 621 ggagcctgca tcaatactct aaagttctca 
4681 tgcaaaaatt tggaaatcga gaaagaaacg 
4741 tacatgtgtt ctcatttaaa taattgtcac 
4801 attccaacaa aagaatacct gagtaaacca 
4861 taaaatcgtt ccccacacca aataggtaat 
4921 ttgaggcact ttgtctccga aaccaactgt 
4981 aatttaaaaa ccataatact ctgaatatat 
5041 ggaatacgca gctcccgagt gtggaataaa 
5101 gaggcaggcg attacgatac ctggcatatc 



atattcgaga accttctctc taatttgtat 
tgaagttgcg ttatatcaac gtgggttggg 
tattcaacaa agctgatatt ttggaggaac 
taccaccgtt aatgaagact tttggagtta 
cgggacaggt ctaccacgct attgatggtc 
aaaagaatct acattcgcac agaaagagta 
ttcataatgt cggatggatc ggaatatatg 
tgcaagaaag tgagaaaaaa ggtgagtgga 
tggatacaca gataagaaat gtggtttata 
gaacatcgcc agtcatacaa tgatactgta 
aaccttctac tctgcgaagt gcaagccgat 
acgtaacaaa atcatgacat cacaaaatga 
gttgataaga cggagaaatg tccattcctc 
tttttcttgc aacttgctga ttcggcattt 
ctggtcaatt gaagcgatct gctgttagtt 
tttttcagag tatatcgtgc ttgcctgaat 
acggagttgt ctttcgaaca ctggcgacag 
gttcgcctat accttgtgag ttttatggtt 
tgttttttcg aaattgacgc acttagtgtt 
caaattctaa tttgcagcag gaaaagttag 
tctccgtttc aactgtcaca acgagctacg 
accatttgag tttcaactca taacctatgc 
cagaaaggtc gtatatgtgc agcgatgtgg 
cagctacaca aagcccggat taattccccg 
tacggaatgg tttgacagtt tctcaccgga 
aagaagtcca atcagtgaga ctgtaagaag 
tccgaagcga ctgttgaatc tgagacctca 
caccgaatgg atgatgcaga tgattcgttg 
attttttgct tataacttat gatctcaaat 
atttttcaat tttttgaata cgtgtttgat 
taaaagtaac aggttttgca aaaaaatgga 
gatttttggc aaaatgggga aaagtcccaa 
tccagataat caatctaccg gtttgtgaat 
tttcacaatt cataaatggc aacatttatt 
tttgctcttc aacactcaat tgttgcttcg 
tttctgcata ttcatggcgt ttcaaaaatt 
aaactggtgt tttatattca caaagattta 
atacatgtgg aacccactct acgtgcctac 
acaccgggcc tgcctaactc atgctgtcta 
gcgcctactt tcctcgtaca tgcctacggt 
gaaatgggcg acatttaata aagattatca 
tttgagatga cttcaagtgg tcgcccaagt 
gcatctttat taattttcca caggcaagtt 
aataattaaa gatcctgagc aaccaaaata 
atatcctcca taataccatt gatgtcgtga 
tggaattcct tggcaacaag agaagtgatc 
attacttcaa cgtcggttcc acggaatggt 
aaaatatttt tttgcagttc aattgtttgc 
gtgtgcggca aatctcaaaa tttgccgagc 
tttgccgagg acggcaaatt ttgcaaattt 
atttggcaaa tttgccgcac acccttgatt 
taataaccac attttgaaaa tttgcaaatt 
attgacatac gattgacgat taacatgaca 
tgcttttcat gaaagtattg gagatataca 
attagtcaca taactaatca gcataatatt 
cactttacag aacttttcaa atgtatcgat 
tgcaaaactg atgaagctga aatttttttt 
ctatataaac ctgaaatagc agtcafcaat 
gaagaagcca attgcaaatg ttagaaggta 
cttgtcattt actagttcgc tttcggttct 
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5161 ttcctaaaaa gagttcccat tgaggagttg ccaagttaat aagaaaatct aactgaccga 
5221 gaaaatccaa ccatgagttt tccagtagaa ttttgtcaaa tattttccaa aatttccaag 
5281 qqtcactaca agaagaggaa tccccaaaat tgagaagaat attgagaaaa ctcgtgaagc 
5341 atgagtctga ggatactgat ttccgtagcc tattgttgaa attgtgatga actcgaagaa 
5401 cacaacttca tagaaggatt gctctccgag caccggatca atgctttgaa aaattgcagc 
5461 tccaaccaac attaatgtag tgagaatcaa gacgattatg gcttgtggag cgattcgccg 
5521 qaggagtttc tgaattataa ttgcctgaca ttgtgagaaa aatttaaaaa gaatctcaga 
5581 cggggaaagt taatcttcac aataactcag tagcagaact ggatatcaca tcaccgattc 
5641 ctcgtgcgag aattgaactc gcagcggttg agtggcgggc gcacgcgcac accactcggc 
5701 cacaaatgca gtcggcagcg tgtctctaaa gacatataaa aagcgcgtat agtgtagtaa 
5761 tgttacaaaa tggtttgcaa ctttgaggat tactgtagtt ttcaactact cgttttttgc 
5821 aaagttttca ttgatttttt tatagttttc gttccgtttt tcattaaaaa cagtttttca 
5881 acaaaacacg taaggaaaac tattaaaaat aaatgaaaat ttggcaaaaa agagtagttg 
5941 gaaattacag taatcctcaa aggcgcacat ccttttgcat ttttgcaaaa tcattttcct 
6001 gtcgagacct ggtaccgtat ttttaggaaa aactcgcaaa ctttcttgag gtttccgaaa 
6061 aataaattaa tgcatcttac ctcaagactg ctcttctgct tcattttgat gaaaaatcga 
6121 gacaaaaata gtggaaaaca gctgtgaaaa accggaaaat actgtgccag atgtcgtcgg 
6181 gcaggtcact tcaaaaacgg aaaattgtaa aaaaaatcga tgagtgatcg aaaaagaaga 
6241 gagacgtaaa gaagaggtgt aaaaacaaac tggcccgtct ttgtcagtag acgaaaaagt 
6301 attcaaagtg ccccatgaat ttacgacact cgttatagca accaaatgtc tgcgcgtacc 
6361 gctctttttt tgtatatcac tcgagaagaa gaaggaaaag agacaaaact gtgtttattg 
6421 tgggggaaaa aagaatgtga aaaatgagaa atgtgtcgac aaattgattt gatgtttagt 
6481 ttcagtgaga gaaaacgaac aaaaaaaaat ataagaagga tctagttgag gcaatcggtg 
6541 gattctccgg agaatgatcc ggccaaattg agctctccga tatcgaatgt tctatcggca 
6601 gtctctccct cgctgatgaa gctctttgga atcttgatct tcagcttctt gtagcaaacc 
6661 ttaaaatata aacgattatc gaatgttcga ggagaacgaa gtgagaatcc tagacagtga 
6721 aaaaccctat- tgtctattcc tagcgccaca tttccgatca atccatcata tgtttataaa 
6781 tgatgtcact tttcaaatgt aactgtttcg ttgcaaaagt tacataaaaa ctcctagcaa 
6841 ttcaggtttt cttctgagat agcggtagtt aaagtttttg gctataggat gatgacgtca 
6901 cgttcttcca gcaatccgct agttggaaaa aaaatgtttt taatcttgtt tagaaagttt 
6961 taatattctg aaataccaga aaatacaaat ttaaatatat cgaaagactg aacaaatcta 
7021 gctttttaaa atataccagt tctctttttt agagggaatt caaacatgga aattcgcaat 
7081 gttacaaatt agacttttta atcttgctcg attttttcat attcgtattt ggttttctga 
7141 acttacaata tcgagaatgt taaagctaaa actcaattca cgctactttg cctgaatcaa 
7201 aattggaaaa taaaactgtt ctagactaca actgttctag gaatggcgaa atttccaaaa 
7261 gtagctcact ctctacataa aaaagtcaat taaatgtata ttcttacccc attatagtta 
7321 cacttgtggt agatgttaac atgtgggtcg atggctgtta gctctttctt gtttccggca 
7381 agtgtgaaag atcctctacc gtcagttgac ttctcgtcga ggagcttgtc aagaactgaa 
7441 acatgtttaa cttctgggtg ccgagcagtc aaaaacctta ccaatttctt tctcgtagag 
7501 cttcacttta atattctcag ctggttgtcc attacaggta acctttccag ttacacgaac 
7561 agcttgaacg gacccaataa atggaaaagc ggaggcagcg gatacaaata aggcgagaag 
7621 cagaagttga cgcattctga atttttatgg aattatgggt gtaatggagg tatttataat 
7681 aataatttta taataacgaa aatataattc taaagataga gaggaagtga tgtattatta 
7741 ggggaatttc aatgagtcat acccccaatc gtaccaagtg atgtgaatga aaaaagaaaa 
7801 aaaagatggt gaagcagttg aattcagtgt tttctttctt cttctactag tcggtaccca 
7861 cccacttcat ctaaacaatt gggccttttt gatatcaaaa ctaacaggga aatagtgtaa 
7921 aaatcacgga gacgcagaga aatatctagt tacactgtgt tgtttttggc aaggtcagta 
7981 agtgtttcat ggtcagtttt ttggcgatcg ggaagtcaaa tcgttttgat aatacttgga 
8041 attgataggc ttcggaaagg attacgaaaa aaaaacaaaa cgtttcggat atggggttac 
8101 gtggcgaaat ggttgaggac actaaacaat tatccagatc tagtttgctg taattgatgg 
8161 gcaattattg gacagagagg tcggaaagtt gaaataatca aaatatgaat gtttaactat 
8221 aaacaagtaa agtcacgaaa acgtgtgaca gacggcaata ctaaaccttc attttcacat 
8281 aaataagcaa acgtagtttg aattttgaat ttaaatttat ttcactttag aaacaatgga 
8341 aaaaaaacca acatacatag aataatgtaa ctaaagaaat tgaaatctgc ctattttgca 
8401 tataaatggt cgttttggtt aattaataaa atacgcactg aaaggcatga ctgacagtgt 
8461 atgttttctg ttactacttg aagcttttgg cagcaactgg gaattcgtta atatattctg 
8521 aaagagctaa aaatttgcaa caatgcttct gaaagaaatc tcaaaataag ttactttgaa 
8581 gtaagttagg agttttaaag ttttgaaact ctgcctattt tgcctataaa tggacgttta 
8641 attcatttat aaagtatgca ctaaaaggta tcactgagag tgtatgtttt ctgtaactac 
8701 ttgaagattt tgattgcaac tggaaagttg ttaatgtttt ctgacagagc taaaaatttg 



8761 caacaatttt tctgaaagaa atctttaaaa 

8821 aaagttctga aaatctacct attttgcgtt 

8881 atgtaatcat atgtgtctag tacatatagt 

8941 taaggcactt tttgtttttt gtttaatatt 

9001 caagatcctt ttgttccagt ataacaatct 

9061 atgttcatta tgattgtaaa gtctaagatt 

9121 caaaacccgg tatttgaagt ttacacatct 

9181 tcttcttttt ctgatttttc ttcattaaat 

9241 cctttgattc tattcgcatt atttattggg 

9301 atgcaggaat tgaagacaaa agtagatcca 

9361 ccgagagcaa gaattattgc aacggatttc 

9421 ccatttaagg taagattttg atattttgaa 

9481 tttgatttta tcaaaattac gtcaactgac 

9541 ttgtaatgaa ccatttttcg aacaaactga 

9601 aagtagatcc attggtgttc aagtttcagc 

9661 catacttaag ctttctttta gttttgtgaa 

9721 ataatttgaa aattgacggt cgtgtcgaga 

9781 tttttcaaaa tcagattatt caattccttt 

9841 cagttgtcag aacttttaga ggtttttgac 

9901 acgaaacttt gcagctaccg taaccacgag 

9961 tcgctcaatt ttttctattt ataatgcaaa 

10021 aattcatgtg aaagaaatac agaaaaaatc 

10081 agcagcaaag tttctgtcgc tctagcaaga 

10141 attacaaaat tttggaaact ggataatacg 

10.201 taaccaacaa tttaaaggta tatgccgcct 

10261 gccaagaaga cttcaggaga tggaagctcg 

10321 gtaaagaagt gaaaagttct gttcgatcgc 

10381 atgagcatgt tggagatttc ttcccatcca 

10441 aattttgtaa ctttgaatta attatgtgta 

10501 actagggttt attcaaatgt gatcacagtt 

10561 ggtaaatgca gtgaaatata caatttgaga 

10621 tgatcagctt gtggttttct ctccgaatct 

10681 ccgtcggttt tttggaatgc cttctcgaga 

10741 acagttccag ctcctccgaa agctttgcac 

10801 gagagaactg gaagagcaag agtacggtct 

10861 agtccacgat agattccaga gttgttaatg 

10921 gcaattgtct cgagttccat ggcagagaat 

10981 accttagtct tcggactgtg atctcttgcc 

11041 ccgactccca ttgttccaaa ggttccggcg 

11101 atcatggttc ggccgatatc cattgtgttg 

11161 ttggcgagga attcacgaat ctgaaaattc 

11221 tccacttact ggttgatatg ctgcgtagta 

11281 cttctcaact gcggctctat tcttctcagc 

11341 tgactcgtcg taggtccagt ccccaagtct 
11401 atctcccaaa agtggaacct. cagtcttcac 

11461 gatctgaaca accttcacat ctttttggaa 

11521 gttgaatcgg gctccgatca ggaatacggt 

11581 ttgtccgatg aatcttgggt gaagatctga 

11641 ccatggaagc ttggatttgg tgaggaactg 

11701 cgcggctccc tttccaacaa taaccagagg 

11761 tttttcaatt tcagcaattg gtggaatgga 

11821 gaagctgatt tcttcttcag tgcttgttaa 

11881 agctccaggt cttccgtaca tcgcacatct 

11941 gtgaagactt gtcggacggc taacatgttt 

12001 ccattcctgg aatgctccac gattctcgag 

12061 tacaggccag cagttgactg tggcgttggc 

12121 tccagagact acaagaagag caactggctt 

12181 ataggcggcc tagacaggta tttgttttta 

12241 cgaaaaggtg ggaaattcag aaatatcacc 

12301 attattagtt gggtttttca tttttcaacg 



aagttacatt gaagtaagtt aggaattttc 
acaaatttac tgattcatgt ctgaacaata 
tcggtacatc ttctaatact tttatcttgc 
atgaattttt agtactctcg ataaacggta 
ttttcaaaac ttaaacgctc taatgaagat 
gttatctcgt gcgcctatta cagcgagaat 
cttctcaggg tacataacca gctttttgca 
ctcaaatatc tcgagatgag gtcgttgtta 
tgcttcactc aacatggagg aatcgctaga 
ataagaagct tcgatcttcc ggatgaagct 
ttgacaagag gcaatagaaa aatgaaaaag 
aaacactact acgcttatat ctctgttctc 
aaaatacgta aaaagtattt atctacattt 
agagaaccga gatataagcc gtcaaagtag 
attcggtatt ttttgaaatg caaaatatcg 
ctgggaaatc gcacgctctg tttggctgct 
ccggctaccg tatcattgcg ttatacattt 
gttctctaaa ttaattaact ggtcattacc 
acaaaaatta tgttcccggt ctcggcacga 
tattttgcac tcttctctct tatataagct 
aacttcaatt ttttggaaag tgttttaaaa 
gctgaaaaaa gtgctgcctg tggttactgt 
cctggttcag tatgtatttg tgccttaaaa 
acactctaat attatgtggc aatcagacag 
tctgaaaatt gaattttcag gacgattccc 
acgagctcac agatcgagat atgagaagca 
cattcacaaa ggatagtgga agtgaagaag 
aaagagttta attttgtaaa ctcagaattt 
aatatatgtt ctaaaataaa acactcaggg 
taaaaaatac aaatcaagga aataattaat 
cttatttaca ttttcgagcg agtcaaccaa 
gttgagatga gagcattgat aacagttggt 
gctgccttga tttctggaac agttctaaca 
atttcttcat aacgacattc agcagttagc 
ccttcgattg ccttatcatc ttctggcagg 
ataacagtga cgactggaag attgtatcga 
ccgaaagcac tgtctccttg aaccacgaga 
cagagtgcgg cggcgaggga gaatccatga 
tcgaggcgac gcttcgggag acgggaaggc 
gctccttcgt tgatcacaat gacgtcattg 
gaatttgaat tatatatatt tattcaaaat 
attcagtgga gttgagtgat catcgacaaa 
gttctccctg agcttcttga accactcggt 
tggtgtcaac tcggccagcg tctctccaat 
gttctgatgg aactcttctg gacaaatgtc 
tcttggtgga agtccgaaat gaagaatcca 
atcagcctct ctcaatgcca aggatcttgc 
agcaactcct tttccacctg gagttgcgag 
ctgaacttgt gtagctccac gctcggacca 
tttctttgcg ctcttcaagg tgtcaatagc 
aactggagct ggaagtggga cttgtggtgg 
cacaagattt ccgggaagat caacataaac 
cactgccttc tcaatatgag caggaattgt 
gcatgagttt ctgacagact cttgttggct 
atcgacatcg gcggttcctc caatacagac 
aagtcctcca atagcgtgaa ggattccggg 
tccggtgaga tatcccatag cttgagcagc 
aaggaatttt atattttatt tcaatgagac 
aaaaaaactt tctgaaaacg gtatttttca 
aatatttttt gttttctttt gtgatatttc 



f 



12361 tgaatttcca acgttttcgg gcattttcga attccccgcg tcttccaaaa ttcacgaacc 
12421 gcttgctcat ttctacatcc gatgtatttg attccatgtg cttgtgctgc "ttcctact 
12481 tcaataaccg ggaatccgac gactccgaac atgtactcga caccctgaaa tctatttttt 
12541 ttgcttattc tgtaagtatt tcttgtcatc aataatcact gataagagtg aattatggtc 
12601 ttatttctga taatctatga gcggaggtga aagtcgacaa tttttcggga aaaattgcca 
12661 cqtggagggt atcagtgaag tttttgtaaa agttataaaa agtaaactac atttgagaaa 
12721 aattaatttt agagcaaaat tttaacattt ttgcaaaaaa gttttttttg tctggaaaaa 
12781 ccatccaaaa tcccttttca gaaaaactcc attttttaaa caaattaatt tttttgattt 
12841 tttaaaatga acaaacctga gatttaaggg ccgccgctgt aatcgatgca ccatccatat 
12901 ttgatccgtc gtgattccgt agttttctcg aattgatgcg actgtctttg gccgaatagt 
12961 cggtgatctg ccggcattgt tgcttatcga tcacactgat aagaccacgt tgtggtgcga 
13021 aagagaattg cgaccttcgc accagtagag acactgatct acctgtgaaa gatatattta 
13081 ttagagagtt tcgtagtttt ggaagctttt atttagatga attgaaattt tgattccaga 
13141 aaacccccaa aaactctgca aaaagactga aacatttttt ggagaaatct catttttggt 
13201 ttactgattc aaaattgaaa cacttttcta tgttttttcg gtcgaaattt tgaataaata 
13261 tattcgggta gatatttttt ataatttgag tttaaaaaat aagaaaattt tcggtttggg 
13321 ttactgtaca ttttagattt taattatttt ttcgctcccg agaatattac cgaattattg 
13381 aacaaaaatt ctttaattaa aattcgtgac tgggtaatat aacctagacg cgaaattttg 
13441 cgattttcgt gacaactttt tgtaaaatac aagagggtgt gcacgcatgc gcctttaaag 
13501 agtactgtaa tttcaaattt ttgtttaaaa tttctgctga ctttctatct acttttaaaa 
13561 tattttcgat aataagaaat ataactttat cggaaaactg agaagaactg tgcaaaattg 
13621 atgaaaattt taaaattcgg gagaaacgaa attttaatac tacagtactc tttaaagacg 
13681 cacatctctt tgcatttagc agaaatttgt cgtgtcgaga ccggatacag taactttgaa 
13741 gcaatgctag caagacttcg caaaattcca gaaaatattg tttaagtcaa taatttactc 
13801 acgtgatgtg acattggcaa gagaagaagc tataaaccgt gtggatgaca tctgtaaaac 
13861 tttttcgttc t.gattgaatg aaaaagcgga agcaaagaag tcaataaaac aaggttctga 
13921 tttgaatgat ctctgtgact cgtttctttt actgtctgcg tctatgtatt caacgaattc 
13981 agtgcaattc aacagcaaag attacaatga tgactattat ttttttgaga aaaccttctt 
14041 cgtgagacgt catcagattc ataaaggtct cggaaatagc ttgatgagta aacatttgat 
14101 ctcaagttca aagaactatt aattttgaaa ccggagatga ttcattgaca aggaaagcaa 
14161 caaaaaaaac taccagtaaa gtgtaattca aggaagaggt aataggcaag atctacagga 
14221 aaagaatgga aaattagtaa aagtgatttc tagatatgaa ctaaaaatgt acaaaaaaga 
14281 gggaaaaata ttgattgtac aaacaaaaaa agagataaca aattgaacgg gttgggttga 
14341 ccttcgagtc attagaagct tgattctgat aaggttgttg agtcgatttt tgcaggagat 
14401 tctgaaattt aaaagtaaag ctttttaata agtttatatt gttttacaca aaaatcaata 
14461 aaggacgaag cggtgaacat tttagttttc tagtttttta gatgttaatt tatgattttt 
14521 aaatttgaat ttttcgcaga atttatgtta tgggcaactg gaatccatct gaatttcctg 
14581 ctcaaaacat taccgttaac tttctcctca tcctcatctt cactcgattc tccgagtaag 
14 641 ctcagatcgt cttctagcga cgattccacc ggagctgagt cttcttccat cttcatcgac 
14701 tcctctctct cgttgtcgct ctttttctcg ttcccatttt cgcttccact cacgacgtct 
14761 tctcgctccg atgttggata acttatcatt tcgggtctga aataatattc gttatttcac 
14821 acaatttaac aggaaaaaca ccgcaatcat actttttatt tcttgcctgt tctaatcttt 
14881 tcatctcttt tgctgttata agcagttggg gagctggctg ctgtggaggt tctccatcca 
14941 gccaacccgc gatttttcca atactggaac tacagaagtt gagccctgaa aaatattagt 
15001 ttttgttatt taactattaa attaaaattt cttacaaatc ggttcataac taaagaatgg 
15061 agttcgaata gctccaagca ttagaacaag gaagattgtc atcacgattt gagctatcaa 
15121 tggagcaccg tcatatagat ttctggaaaa attaatttac agcttttttt atgaattcgt 
15181 caaaaaatac ctgaagaata aattaaagtg cttcgcagaa ccagaaaaaa acgcaaagaa 
15241 tcctccggaa atagtctcag aaaacacttc gatgatagaa atctcgtgga aaatcgatat 
15301 ggtttgactc tgaaatttac aacatttaaa ataatttaac ccaacaaaga agaaataaaa 
15361 caaaaactaa cctcaatgta ctttaaacat tttgattttt gccgatattg gaaaggcgat 
15421 gcgaggattt caaccattct tgagagtaat ccttctgtag cacaagaatc atgagcatgc 
15481 tcttggaatt gtgcaaatct tcgtgattct gcttcttgat attttttgct ataaccagaa 
15541 tacatggaaa caagaagagc agttgagaga acgagaatcc aaaagtttcg tggccgaacg 
15601 attgatcgaa gaacaacaaa agcagccgcg ggcaaaatga agatgttcag tgtgaaaatg 
15661 aaaggttgga gagtcacaaa tgtatcctgc caagtacgat ctggagctgg aggcttcacg 
15721 ctgaaaatgt ttgacaacgc ggatcggacg gcttctctct cttcagccaa atctgcttca 
15781 tcatttttta aatacttcga aagaaccgac atgctgtgcc gagatatttt aattctgcaa 
15841 ttcaaagctt aaaatatatc ggaactagaa gaaaatatac tcactgtaca tcttttttaa 
15901 ttgaaccttc caaatgatag tcaattttgg tttcgtggag aagttgtcga agaatcagac 



15961 gaagactttg atcaggctga aactattgaa tgggttattt acggatttct ttaaagcttt 
16021 tagtttacct tctgaacttg atttcgaaca gatagaagtc gttttgacgc tggcgggctt 
16081 tctccagaaQ ttggatcatt cggatctttc cacacatttt catgtgtaaa ttcttcgtca 
16141 atgtcttg&g ttgcgaaaag ttgcgggacg aggagtagaa gtgtgaccca tattattttc 
16201 atctgaaa^a agcggaaagt tgaaagagaa aagatgttta cagcacgctc agcaactgac 
16261 aggcgagttc ataagataac atagttgttt gatcaataaa atgttgtgtt tttccgagtt 
16321 ttcagata&t tcgcatttaa acttcacttt ttatattcag aaaattgtaa aatcg<aaaca 
16381 tttcaagaat tgcgaaacaa gttttcagaa ggcaaattaa taagtataat acggttgaca 
16441 attggtgtat gcagaattgt cgtgaatttc gatgcctgcg tacgcgagaa atttgcattc 
16501 attattttct tttccacagt tttccacact tttctcatcg atttttgcaa taatttcatc 
16561 aaattttctt gtttttcagt aaacttcaag atgaagcttt gggattctcc gaatacctcg 
16621 ttcccttatt catttgatga ggttgccagt gcgttttggg acaggtgagg caggaaatct 
16681 cggaaaat<gg taatttcaat ttctaaaccc tcagataccc aaattctcat gcaaaacaca 
16741 tcatttcg<? a ggatgtactc gaacgccaga tcactgacaa cactattgtc actaaaaaat 
16801 tgattgtcaa gcaaggtatt tagctgatat tacaagagcc agtaaattat tcgcataatt 
16861 tttcaggaag ctcgattttg aaacgtgttc ctcgttggat ttcgagaatg acagatattc 
16921 aagttgttcc tgtcattgaa gaaagtgttt atgataaagt atcgaaaaaa ctcgtaacat 
16981 acactcgaaa tgtttcacac atttctctat ttcaactcca tgaacgatgc atttacaaat 
17041 catccgaggt ttatcattgt tatttcaaat ctctatttac tgttattttt caggataatc 
17101 agcagcatca cccagccctt ctcactgacg tactgcgaag tgtcacagtt tcaatcgatt 
17161 gtggtcgaat gagctcagtt tacgagaaag tcctactaat gggcttcaaa aaatcgatca 
17221 ataacacaac gaaagggttg tttgagaagc tagaagagcg atttggagtc aaaaacgttg 
17281 ctaatgagaa aatgaaaatg atcaaggaaa agattatcaa gtcatcgtca aatctcgtca 
17341 ctcatgtcca atgcgaagag gatgtetagt cagccatact gaatataaat ctcaaacatt 
17401 cactttttcc cgttattttc atttatgtcc ccccaaaaaa atctcaattc tcatagaatt 
17461 actgaatctt ctgcacattc tttgcattga actgagacca ttatttgttt gttatttggc 
17521 tctcaatctc tcctgataaa tgtacatatt caatttattt tttgaatttt ttttggctat 
17581 tttctggtta tttaccagct attgttatct ttcttgacct tttttgtctg aattacatta 
17641 taatctatca taatctccaa tatttaggct cttctcaata aggcttttca aaactccttc 
17701 aaaaagcttc ttggtcttcg atcaacgata tggcgagtac tatttaccgg tgagtgacct 
17761 cccaaaagtt tctggaacac ggcagttgtt ccacttgaat cagcaaaaaa taaccatcaa 
17821 tcacaagaa^ aagagacata gatatcaaaa gaaaatcccc tgttgcagcc tgatcacaat 
17881 cgtctcgctt ctctcattgc tccattgtgc ctattccgcc gcccaacacc gtttctacct 
17941 gcgtctcact gagcaaccgt tcatcaatct cccatccgac gtcgtcgctc agacagttat 
18001 ctccttgatt gcactgatct acggaacttc cttcctggcc aatccattcc ttttcataaa 
18061 gaaggatcaa gtgagttgga ggagagattc atgaaaattg tttaaaccaa aaacgaggaa 
18121 aacccgattt tcgtaatgag aatatctgaa aattttcaaa aaaaattatg gctgtgcaaa 
18181 gtatttgaa 3 aataactatt ttcagctaaa atctcaaatt ttgccagttt agaagtgtta 
18241 aaaaattccg tcagtggcgg cactgacaat ttggcaaaat ttgagatttt agctataata 
18301 tgcgcctttt tccaatactt tgaaccgcca tgattttttt tgagaaattt tcaaaggtcc 
18361 tcatcacgaa aattggcagt tttttggtta gtttgggtct aaaaaagcaa ttttttctct 
18421 ttttccaatt tcgctactac gcctttaaaa aacttaaaac aaaaattagt tcggagcttt 
18481 aaaatcaatt ttcctgtttt ttggcatttt caatttactt caatttccca atttccagca 
18541 ccacgaccga tcatgcgacg aagcactcaa ttgtccatca ttcatcacat tcgacagccg 
18601 tgccaaagca atgtccccag aatttgcgat gctccgaacc ttgaaacaac agcaacgtca 
18661 acgacaaaac tcttctcgtg gaactcacag tggacgtgtc tccgaagtta ttcaaacgga 
18721 agaagactcg gccgacgatt ctcaggaata acttcaatct ctgctcgtca cttctctaca 
18781 aaatcctctc cattcctcga cacctcatct aataataaca tttccaatga taagaagaaa 
18841 acacctagtc gttcgccaga ctttagcaat attccgggtg cacggtaata atttgttttg 
18901 tcttttatgt ctttcaacta tttaaaaata taaatctttt ccattttata gtatcccaaa 
18961 atcgatgcca aaaaggaccc gattcctaga aatcagtaag cagcaggccg acgaggcatt 
19021 cagaagttgt cttgaaatgg ttcgaaaaca cgatatcgac agttatctag ccattttgac 
19081 tatcaataaa cgagcacagc cagaaattgt cgcggtttgt agaaattcga cccaaaaata 
19141 gttttaaaat cgggcaaatt cagcttaacg cactgaacgt cgagttggct tcaattcgtg 
19201 acaaagtg9 a cacaagaaaa ggagacgctt cagcaatgta tcgccttcaa ttctggaagg 
19261 atgctatttc atcaatctac ggaatatctc cacttcctgt accacgtcaa cctgttgcta 
19321 ttgctctgtQ ctctttcgcc gctggtgcca actctgacat gcttctgaaa ctggttgaaa 
19381 ctcgccagtc aacaattggt gataggcagt tttctgatat taatgctgtg agttttaata 
19441 ttgaaaaatt tcgaaaaaat atgtccgaaa aatttaacat attcccaatc gcaaatattt 
19501 tgtgattttc ttaaattcaa aagatcgtgc gcctttaagg attactgtag ctcccatttc 



19561 ctcatcgttg agtgattttt agaagctaca gtactcttta aaggcacaca ctcttccgag 
19621 tttaagaaaa tcgttcgctt cgagaccggt tacggtgttt ttggagcaaa aaccagaaac 
19681 ttttgcatct gagtaatatg tttcttttgc tctatgaatc aaagaaaaaa atcattattt 
19741 ttcaatgcgg aatttagatg agaaaaacca attttggttt tttttaatct aaaatgtaga 
19801 taaaatagac aaatttcttg aaaaaaaaaa attaaaatta ctaaatattt ttccagttat 
19861 gcgaatatgg caaatcaaca atcgggtctc ttctgtgcct acaaatcgat gctcttgctc 
19921 gtaactctcc agaaactaaa gtactcccaa tggcctacga tgttgcaaaa gatctcggag 
19981 ccgcgtatgc tatcgccaat atgattcgtg ccacacatcc actactggct cgtggaatcg 
20041 ttcttctacc agctgatgtg atgtctttaa atggtgccac accggatagt ctttataaaa 
20101 agaagaagct cgatgagatg gttggaatga caaaagatct ggtcaacgag tcaaaacgat 
20161 tacttattga tgctcgaagt ccgatagaaa tggtaccgaa agctgtgaga cctgcgcttg 
20221 cagccactgg agccacaact gactatatca ttaaaacaat cgaaaagaat aattatgaca 
20281 tctactctcc tcatcttcaa cgtcgaaatc cacttcttct ctggtcactt ttggttcgaa 
20341 aactgtgctc aaaatattaa aatgttatat ttttcgtgtt ttttcttcat cttcatcttc 
20401 tttttttttc tttcctcttt ccctttcccc ttctccccat ttcgtgttct ctctctctca 
204 61 ttcgaatagt tatatttata tttatggcat caagtaaaga gcttacaatc agtgtggagt 
20521 actacgaaga ctagtaaacg gataactgaa tcataacaag gagcccatcg taaagaagag 
20581 cgggaagaag aagaatgtgg cgaattagaa tgtgcaatac gagaatactc gaagatgaac 
20641 cctcttttga attgttgttg ttgttgttat tctggaagct attttttctc ttgagagaag 
20701 agaacatcct caaaaaaaag atataacaac aacaatttga agggggaagc catttttaca 
20761 aatttagcgt taagctggaa cacttctctc gaatattatt ttaaaactac atttagacta 
20821 catttactgc tcttgcgaat ttaatttaat attccgatta acctgatgcg aaatttttga 
20881 acaaaaacac ggaaccgaag aggtgtgtgc ctttaaagag tactgtagtt acaaacttgt 
20941 gctgttgcgg aatttttatc gacttttcat aacgatttgt caaaatgtat ctgttttaag 
21001 tgatttatgt tagtttttaa aaaaataaat taaaattttt taattaaata aaatttaaac 
21061 aaaaaactat gtaaaatcag ttaaaattcc acaacagcaa atgattgaaa ctacagtatt 
21121 ctttaaaggc gcacgctttt tcgctttttt caaataaaaa tttatcgtgt cgagccagat 
21181 gccgtatttt gcatttaaca gtaagtacac aatcaagagc gcccttacgg aaaacttcac 
21241 ttaatattga tatataatta cataaacatt atagtaaacg tgcttacctt tgcgatacgg 
21301 ctgttatcag ttggaaattg agtatgaccg gcgaaaatag caatcattat gagaaattca 
21361 caattcaatc agcttttata cgcagaattt ccattttaga atgagaaatt tctatcaagt 
21421 gtaaatcaaa gaaaatttaa agttagcaat ccagttttga caccgtttat tcgggtgttc 
21481 atcaatttaa acatgttttg aattttagac atgttaaaag gaatctatca ggtctaataa 
21541 ttttagctga atatttccga agagaattag agcagcctgg aaaaaatcta aattttttaa 
21601 caaaaaatat tactcggatg cgaaatgctg caatttatga tggaaaaata cggtaccagg 
21661 tctcgacacg gccgcataag tttatatgaa aatttttatg cgccttcgaa gagtacagta 
21721 gttttgcgca gcttttcgaa ttattttcac cccctctgct cattttgcaa attctgaatg 
21781 cattctgtac atactttaca aaagtttccg ttcaacttct gaaaatttgt ttgctctttc 
21841 cggaggcctc cgccaatctg tcaactccaa tttgctgaaa atgaactcaa ttcgttgttt 
21901 acaagaaaaa tgctggaaga agagaacatg cagtattaaa gatgtttatg tttttgaact 
21961 cgcatcctga ttttttttgt gtttttgtgt ctaaagttct gggatcagaa ttttagattg 
22021 ttagtgttag ttttgttgtt ctttttttgc aaagaattgc atttttccgg gcgttactcg 
22081 cttcttcttc ttgtgattct gtatgtttag gcaaaaagtt cggtaaagtt tttttgaaat 
22141 tgaaggctct gaagagagat tttggtgaca gtgttctcgg tttaaagctt ctatgcaaag 
22201 tgcatagtct tcctagaaaa ctgtccagac caccgattca tttttgtaaa accctgccca 
22261 tcatacatat tgttttcacc tgttctccaa tatttttcaa ctctcctctc actttcaaac 
22321 tgtcacgaaa tgctgaaaga gaagctctca atacctctgt ctgtctcact tgatcgcgag 
22381 accccctctc tgtatttgta cagtaaccac tgcgtctcat cgatccacgt actccatatt 
22441 attgtctcct gcatcaaaca cacctagttg cgcatgtttt tgcaggagcc gctcccctcc 
22501 tcattcaatg caatccatca atttcccgtc cttgccaaat acgacattta ttataagcaa 
22561 ctttttttat tggcctttta aatttcaaaa gttttttcaa ccctaatttt ttgatctttg 
22621 ctagatctgt tgtttgatga attttgagga aaggttgata tcaagaattg agctcaggct 
22681 cgaaaatcgc ctatttgtgt gcctaagcca aatccaaagc ctaagcgtaa gcctgagcct 
22741 aagccggata ctaagcctaa gcctgaaaac gattcatttc tcagaaggat tctcgaaaat 
22801 tggttatcat gtttcatttt gtatatccaa tgcagcttct aatacaatca tcactaatgt 
22861 atcggaacgg atcgggaaaa ccgatagtaa tcggcaaatg ccattagtct atccaaattg 
22921 agacaaaatg atgaagcatc aaggacgcgc actcctcctt atctcttatc aaccgatttg 
22981 ccacgtcaga ttctgtctaa ctttcacctt ttttctcgaa tctccttctt gccgaaagga 
23041 tctatccgtt ccgtgactcc agtctttcat gcactttctt ctttataaga aagtgatagt 
23101 tgcggactta tcactgctta tcactattga agccacccaa agaactagat atcggaaaca 



23161 tgttagccgg tacgatgatt cgagacttga 
23221 gatggattac gcaaggcgaa aacataaaac 
23281 tcagataaag ttcaacattt cgatttttaa 
23341 gagatagcct ttttgtttta gaatttgttt 
23401 ctgaacccat ttaagtttac tgtagaaaat 
23461 gtattcctgg gactcactta catttatata 
23521 tggaaaaaat tctacgataa gcacataatt 
23581 atctatggta tctaagcggc ctgcggctgc 
23641 cacacaggaa acatttatca tttcccagtg 
23701 gtgtgtatat ctacaagatg caagcgtcaa 
23761 agaaaacgtg cctacctatc atgaaagctt 
23821 caaaaattag ccgagatata attgtttaat 
23881 ttgaatgtct ttgttcttga ccccgtcaat 
23941 aaagtacctt gtgtattttc atatttacat 
24001 gtttcctttt ttcggcgcaa caattctcgg 
24061 ttcgatagtt tcaattgtta caaggattgc 
24121 caacggcaaa aattgaacaa gtaatattca 
24181 gatttcaatt ttccaactgt aaattgagca 
24241 ccaatgatta ccatatgttt acgactataa 
24301 tatgtttgac gtgaaaaggc gggagatttc 
24361 tctctttctc tctctctatt ccaacaagcg 
24421 tctattcatc catctcctct tctgttatcg 
24481 atttgtatat aaactgtgtg tggcgcagcg 
24541 gtgatcgacc aattgaggtc aatcggagcc 
24601 cgggccgctg cgccgtgatt tttttgtttc 
24661 gtatattgta aaaatttgga gcaatatcgt 
24721 tctagattcg cttaggctta agttaggctt 
24781 cggcttaggc gtagtcttac gcacaggctt 
24841 aggcttaggc ttaggcttag tccaaggcta 
24901 gcttaggcgt aggcttaagc ttaggcttat 
24961 gcttaggctt atgcttaagt tcagtctcgg 
25021 ttaaaccaaa gaaaaccccc taaatcaatt 
25081 gatggcaact gtacatggca actctcgtaa 
25141 caacgtcaaa gctcttttta attttttctt 
25201 gttcaaattg gtagtggtct gacctaaaat 
25261 catctatcaa atggtattca tgtgtttata 
25321 taaatataaa caccgaaaac gcattgttcc 
25381 gtttggttat atgttgacca agtgtttggt 
25441 cataagagtt gagtaaaaca ttttccaatc 
25501 atattttaaa ccattggtct catttttatg 
25561 aagaaaacag tgtgcaattg ctaaaaatag 
25621 tgatctgctc aactgttgac tggtctgagt 
25681 gccgcaaggc aatagttact gaattccttc 
25741 ttgcaaccat ttttttctga tgttctagat 
25801 tttttgagat caaacttgca gttcgaaaat 
25861 taatttagga attttgttat acatatcact 
25921 aaaatcttaa ctgctctcca cggaacttgt 
25981 gcctgagtcc aagcctgagc ctaagcctga 
26041 tcttaaaatg ttacaagaag gtttttcctt 
26101 gctatttaga cttagggtgc ctaactggaa 
26161 ctaagggccc gaaaaacata ctaggatgcc 
26221 gacacactta agcctaaggg cccgaaaaaa 
26281 tggaaatcct tatgacacac cggcggtatg 
26341 tcaaaataca tttgagctcg tcgtgcgttt 
26401 atctgtggat atttttcagt aatctaaatg 
26461 tttcataatg tttgggggga tgcgagcatc 
26521 tagcggcaaa ag.tccaaaac aaagctcaca 
26581 gtaacgaaga actttaacga tctaacgaag 
26641 atctccgtgg ataaaatttt tacagaaaag 
26701 tatagaacaa gtttgtagtt gaaagttttt 



aaaacaaggg cttggtgagg cgaagacagt 
ttgaaacatt tggcgatgat cagaatatta 
tttttttgtc attcttcgag ttccggaagt 
tatttttacc gtttttccct taacctgcta 
tggaaacgaa acttcgagta aatttttttg 
atttgtactt tgaaaatttt ttaaaaatac 
gatgtaggca cagaattcag catgtctggt 
cttgcctaag agcttgctta cttccagaag 
gagtttctga caaactaatt tgaaagagaa 
gtacctaggc aggtaggtag gtaggtaggt 
caatattgcg cattatgcca attaaaataa 
aggaaacttc cagaaattca attgaataca 
gggaaaaaaa tcaaatgttc aaaaactaga 
tctaaaataa .cgtaaacaac gattacttat 
caaaaggaaa ctatttggcg ttaaagggaa 
tcaaaaatca ttcaatgaga agctgtagag 
atccgtccta actttttcca gagtgggttt 
cgaaatttca aacttcccaa attgggaata 
gagagctgga ttccgtgtgc tccatctgga 
aggaggaata agggcggagt gtgaacggtt 
ttcttcttct cattttctgc ctctcatcca 
tcgtcgtcgt gtatgtgaga aagtgcgtct 
cttcggccct ccccggccag ccgtgcgcaa 
acaaatagaa tctatgtcgg gggctcgtgg 
togcatttta gtctaccatt ctatcgttag 
ttttttaaat tttaaacgtt ctcattttca 
aggcttatgc ataggcttag gcttagtatc 
agacgcaggc tcaggcttag gcttaggctt 
ggctttgctt agacttaggc ctagtctctg 
gcttaagttc aggcttaggc gtaggcgtat 
cgtaggctta tgttcattct tactagggtt 
tgtgaccgaa actaaatcga acttcaagtc 
cttccgttat ttaacttcag tgtattttct 
ggacactcca acgtcacgac cagtataaaa 
tgatctaaat ttagtttcat agaattttct 
actatgaaat tgtttccccc ggtggaaata 
aaaaatagat gggctgtgtg gtctaaaatg 
caatatcaga atgtatatat ctatacacat 
aattccaatc tcaagccgat ttaattttag 
caaccaaacg gttcaccgga cgagttgaaa 
ctccgaagtg aaacgtgggt gtccctaaat 
caatatatct gtgtgaagca tgcatgcgaa 
taggtcatcg aacaggtttt tattacacat 
ctgctacgaa cttggaatgc aattggttca 
accttttcat tttgtttctg tagctcgtca 
gtaattaaaa agattcttgc tattttctaa 
gtatgagcct aagcctaagc attagcacaa 
gcctgatcct aatcctaaac ctaagcctaa 
gcgcttggag cgcaaaagaa aagaaaaaga 
taaaacattg gaaatcccaa tccaaccagc 
caacttgaat aaaatattgg aaatccttat 
aaactaggat gcccaactgg aataaaatat 
gcgcggctta agcctaaata gccactttta 
tacaataact tctcaggcaa ctcaaaagta 
aagactatag attactaaga aacttggaga 
tattggagat ttgccatttg atagaacttt 
gtgggctctc aaagatcatg aaatagcact 
caattttcca aattcacttt ggcagctcat 
tcatcaactg ataagttgtt gatattgttg 
tttaccaaaa aatttttgtt tgagagaaaa 



26761 gcattagaaa cggaatagcg tcataaaaat aacaacagca gttgccgcac ttcatgcggt 
26821 tttatctcaa acaaaagcgg agatatgagc taccaaagtg aaattggaaa attggccctc 
26881 caatgcttcg ttaaagtgct attttcagga tctttgagag cccgccggga gcttggttct 
26941 ggagttatat tgatcaaatt aaccctccag taaaggggga cctttgatga atataatcac 
27001 tctgatggta tttaattccg atgagtaatc catttttctt tttctcacat ttgtgaacca 
27061 aaaataagtt ttaaataaag gcgggatatt ctaaggtgtg ataacatatg atatttattt 
27121 ttaaatttaa ataaagtttt tttttatttt tgctaaaaaa cgaatagttt acaaccgcct 
27181 cgctcaaatg tattttgata aaagtggcta tttaggctta agccgcgcca taccgccggt 
27241 gtgtcataag gatttccaat attttattcc .agttgggcat cctagtatgt tttttcgggc 
27301 ctttaggctc aagtgtgcca taaggatttc caatatttta ttccagttgg gcatcctagt 
27361 tttttttttc. gggcccttag gcttaagtgt gtcataagga tttccaatat tttattcaag 
27421 ttgggcatcc tagtatgttt ttcgggccct taggctggtt ggattgggat ttccaatgtt 
27481 ttattccagt tgggcaccct aagtctaaat . agctcttttt cttttctttt gcgctccaag 
27541 cgcaaggaaa aaccttcttg taacttttta agagggtttc atatatttta ttaaaatcgg 
27601 ggcgaagccc tgattttaaa tccatattgt ttttttttgt cttccactat ccctgcaaat 
27661 aggaaagaga atgtgttgtt ■ tctgatgaag taaaaatcat cataaaatct tgaaaactga 
27721 gagcaggagg taatatttga atatattggg ttggaaatgt gtgtctccct gtgggtgggg 
27781 tggcgatgtg ttggcatcca acccttcaac gaactgtatc tcccgcctgt atctcccttc 
27841 aaagtaagaa ttgggttaca aaaatttggg ggaatatgaa aataaatgta aggatttcaa 
27901 aaatgttaat tttgaaacac cagaccaaac cactttttct gggcaagagg caaaaaatta 
27961 attttttgaa aaaatttcaa actggcacaa aattttttca aaaataaatt ttcacaaatt 
28021 tttggtagat ttcacttttt ataaatattg ttcattgaac acagaaaaga aaacgaagat 
28081 tcatcaaaaa atgagtgaaa aatcgcaaaa attcgaaaaa gtccgtgctg aaaaactcga 
28141 ttttggcggt gctgaaaaaa attgctacta aaattttttt tgaaacttag tttttcggat 
28201 ttagcgtcaa attttgaatc tatgtaaaaa acagaattaa aattgatctc agattgagtg 
28261 agtaataaac gctcaaagtt gaaaaatgaa caacgcaaaa acggcagtaa cttgcttcaa 
28321 agtcggttat ctcagtgagt tttcactcaa ttttcgaaat ttttttgctc tatcgcttta 
28381 gaaatatttg taatttcatt ttttttcctc aaaatcaaaa tatctcaaac gaccgccatc 
28441 ctacgagaag ggaaaaaaaa aagtttttgg aaaaaaaatc aaaaattttt tttctgcctc 
28501 gattttcaaa atgaaaaaat cactttttcg gaataatttt tactttaaaa tgtacctacg 
28561 atctttttat acaacaattt caccttggaa ccaaagattt ctcaaagaca acagcgcacg 
28621 aagcggtggt ttgcgaaaaa acacgaaaaa aacacaccga agtgggaggg ggaggtcaaa 
28681 attgcgaatt tgactttgga gctcactaaa agagctcata gctgatggat ctggatgatt 
28741 tgtatgtttt tcgacgcatc tcgtcgagta ctatatactg ctaaaaaatc tcgtaaaaat 
28801 cgcaaaaact cgaattttcc tgcaaaaaaa aagaggcgaa aaatgagaaa aaaaacgtga 
28861 tttttggagg atactcgaaa atactcttca tatctcggct caaagagctc caatctttct 
28921 gaaataaagc atacagaggt ggggcaagct tcttgaagag acagcaaaaa accgcatcaa 
28981 aatccatcca cccaccgtca agttacacgc gcgttttcaa agtgaacaaa aaacagggca 
29041 taaaaaaatt tagacgcccg cacccggact cgaacttttt ttatattttg aagcttaata 
29101 tctcggctcc tgtacatcga aattggctca aaattcacaa gaaaccttat cctgctacga 
29161 tcctcctgtc attaaatttt cgtgagctta gaccgaaaac tgacaaaacg ccaaactttc 
29221 aaattttaaa aaatagttga gggtcatttt gaaaacacct taagctccat aactcggcga 
29281 gttttgatgc aatcaatttg aaattttaca cggaagcttc ttagtggcct atacgtctct 
29341 gtgttaaatt tcgctttgat cggccaacgg gaaccctttg aaaaatgaaa aaaacgttat 
29401 aaacccaagc ctaaatagag ctaacgctcc cccactgacg ccaagcctaa gtctgagcct 
29461 attcctagta taagtctggc ctgaacctaa gcagaaatac cttccttcgt aagttcccgc 
29521 caaacatatc cctttctcac tttcattttc caaaaacttt ctcccattcc gcactcttcc 
29581 tccggctcga ccgtccgctc cgccttctca gtctctctgt tcccttttct ccatctttcc 
29641 gaatatccga ctcacttaca caaacaaaca aacaaaagtc ctttgcgaaa agcacatata 
29701 ccgtcccctc atacatacac tacaagagag ggggcccctc cgttttccct attgattccc 
29761 cttctctcgc tggtcactca ctctcttatt tctccccctt tttccatttt tatattcacg 
29821 attcaatgtg tcatctaaac cttctactag ttctgaacta ttttttctca gttttttttc 
29881 tattccggaa tgtttgtcct ttgaagtgca ccacgatgac cgtaaccact tttgtacact 
29941 atttcataat tttggagtct caaagttgca tctaggtttt ggtggttttg agcggagggg 
30001 cacgccattt ttttggaatc tagttcttat tgttagaagg gtaagaattg tgatcttgta 
30061 gacccaaaat attatgacct caaaaaccag tcatttcact tatttgaaaa ccaggggcga 
30121 aacattttgt cttctactat aaaattatct ctgattatat ctactgcttc aactcatcct 
30181 cagacctacg tgcctacgtg ccatgtttct tgtatgattt cttaatctca ctgaattttt 
30241 ttcctatcat tcggccgtaa gatctctgcc aaaaacttta gcaactttag gaacttttaa 
30301 accaaatttt gccagttttc tgtaatttta agtgagaatg tttgtaaaac caatttgtgc 



30361 aactagcatt tctctagatc ctagaagcct 

30421 agatcaaact agctagtaag atctctcgat 

30481 aaccatggtt tgtgccttct ttccaggcta 

30541 tcaaatcaat tcacccaatt tattcaatct 

30601 agaagcatca aatgttgatc tttcgaaaaa 

30661 aagatccgtc tcagaatgac aaatgattcc 

30721 ggaaacgttt tctttactta tttccttagg 

30781 aagttcctga atcaacagga accaagcagt 

30841 ttataatgtg gctgaaaatt ggtaactatt 

30901 gacagttatt gacagtctat ctttctatct 

30961 gtatacagta gcccgtacta tctctctacc 

31021 aaacccttta aaaaaacaaa aaaaccctaa 

31081 actacagtac ccctgcagta ctacagtgct 

31141 aacccactta gtatagaaac ttaaattcca 

31201 tttccagtta aattggccaa agcggaaact 

31261 ccgttttctc tagttttatt tgaaaaaaac 

31321 ttatcactca tctcatgacc ttccatcccc 

31381 aacacaaatc aaaactgtac tcttcctccc 

31441 ctggcgcgtg taaatggtag caaaagctgg 

31501 ctcgttgcat tgttgtcaat cactcacttt 

31561 agatggacga actaatcacc ctggaaggtg 

31621 atgatttttg caacttttcc aatatcaccc 

31681 ttgtctggtg gagcaacgtg gcagttttac 

31741 atctacttaa tatggcggtg ctgacctcga 

31801 accttttaat agctctcgca gtatgtgata 

31861 taactccatt atctattcca tcactggcat 

31921 gaattgtgct ttatattcgg acattggcat 

31981 ttttagtttt ttgaaatttt attttatttt 

32041 tggcaaactg ccaaattttg gaaaaaatta 

32101 tttgattatt tcatacatag ttgttcaatt 

32161 ttcaaaatat aatcgaacga ctcaaatcca 

32221 ttcgctaata ctttttttca attttgcaag 

32281 gaacgttacc tgtgcgtcgt ttgtccgcta 

32341 tcgaaaaatg cgattgccac tgcgtaagtg 

32401 aatctataag aaataactgg gtcaacagtt 

32461 tgtgtatttt gaaattccag ggagttgaag 

32521 tccaattgtt ggaatatttg gaaatttaca 

32581 aaactgcaca tttgtagtgt tttgttttgt 

32641 aaaatgtgtg cataacaaaa cttggcataa 

32701 cctgtgctcc attcaatggc cattggccta 

32761 aaatcaatac tactacgtga ttctaatgtc 

32821 gttcttcttt cctacactct ttttttttcg 

32881 acttgatatc cgccactgag ttttgttgta 

32941 aaggtaaatt agaaagatgg ggaggtagga 

33001 gagcaaatcg gatttgcaca aacacattga 

33061 gtttgtgact ggaatgacta attgtttgtt 

33121 tggcgggaga atagaggtca aaaagatgat 

33181 ggaaatttat ctagggacct ttgatgggac 

33241 ctccttttag aaattaatat agtaatgtag 

33301 tacttttact atatgatttt ggaacattgc 

33361 tgtgcgatca gaacattgtg actttgaggc 

33421 taaaagtaaa gttattttca gaaagttcaa 

33481 aaaaagaaaa cttcgacaac catgagaaaa 

33541 gatctcgaca atttttgaaa aaaattttca 

33601 tttttcaaat caaatttatt cataaatttc 

33661 tggactatgt gtctttattc gctttcaatg 

33721 attctcgaat aatcttcact cttcgcagag 

33781 aaaccaaact ttccgatggt ttgattcagc 

33841 caaatgcaat gcttttcgca gtcgtcttca 

33901 ctgcaagaat tctgttcgat atgtatggac 



agatcctaaa gttggtcgag aagttgcgta 
caaagaactc cctatccgtt tcctacctaa 
cctatacatt ttgatctatt tcctcatacg 
tgacgaaatt tttctgtcag ttgatctata 
ttgaaaatac tatgaggaat ataaagtcta 
gtttttgcac aaacatcgtc tgaccttggc 
atactttctc atgttaaaaa aattggtata 
atgggtggtc tggctttgac cactctggct 
tggttgatca aaacatttgt cgcacactct 
ggtctacagt aaccctacag tatccctaga 
aaaaccaact gttcctcaat tgaaatacaa 
caaaaaccca tcctacagca ctcatacggt 
cccacattac tcctacagta tccctaaact 
tgtattttgc tagttgacaa ttttttgatg 
atacttcctt ttccaaattc tgcaaatttg 
ccaatattta attagattac ttctgcaaca 
tcctttttca tgttcacctt gagaactcgt 
cctcccgccc gcgatcactt ttctcattgt 
ttgtaatcat aattacactt tgacatttca 
ttccattttt ctcgctgacc tcctcattgc 
cttctcaatc tgaacaaatc attggatccc 
atcatgagca cgatgagcaa tcgatttcga 
ccgtgattgc tctgatagga ttggcttgta 
ataaaacagc acgccgaatc ccttcgtgga 
gcttattttt gatatttgct actctggatg 
tttccacttc tttcaatcat ttttattcaa 
caactttcta caaaagcagt gttttgtaag 
aaggtcgtaa tattccaaaa agtaagtttt 
ggcatgtata taggcatgta tgacctattt 
taagattttg aagctatttt taacc.ttcat 
tttccacata gcatttaaat tatttcacat 
tattccagaa ttgttgtggc cttcaacatt 
aactcgcatc gatggtgtac ttcaagaaac 
ggggttctat ttttagcggg attatttgta 
ttggttaatt gaagttatga ttctaaattg 
atccaggagt tatggtagtt acgcagaaag 
aaaactttaa attaaaattt caaaaatcta 
caaaaaaatc ttccttccag ttttaagttt 
ctcgattaat tccagaattg tgctctcatt 
tgacacaatt cgttgtttcg aaagcaattc 
gacaaaccgt gctcttcagg tatttaccat 
tttccaatgg cattgtggcg caaagtagta 
attagaaaag cttttactta ctgtttggag 
atctttgaaa tttattttta gaatcaaaat 
gaagtaaaca taacacccga ttagttgaca 
tctatagtaa ttcgctgagc aggggaaaaa 
aggtaattat gagtcagtta ggttttaggt 
ttgcaacata agaactaatg tatcgttttg 
agtaatattt taactttctg aaaacaatgt 
tcaaaaaatg ccattaaagt tgcaatgtga 
agtagttgag caatttttga aaatgttcat 
atattgttct atatttccat atacgttttt 
aaggaatttt gtgaaacaag ctggaaatta 
attcaaaaat tctagatttt atattttgta 
caattttttt ccagatttac taccgaacaa 
tcctcccaat tattggtctt ctctacatga 
ttgttgatga agattcaaga aaatacgaag 
atgatgctca taacaaccga actatgagag 
tgcttttctt ctgtgttgga cctcaggctc 
agtatcatcc aaaagccata ctttatgtat 



33961 gtttgagcca acaactggta ttcctgaatg 
34021 tgtcaaagag atatcgaacg ttgatgaaac 
34081 aaggtgagca acattttgaa attttgtcga 
34141 ttcctggaaa tgtaagaaga aatatagaaa 
34201 caaatttaaa ttttaaaaaa ttgctcaaaa 
34261 atatgccatt tgatatatcg atatttgatt 
34321 tataaaagtg aaactatttt tagaaagtac 
34381 ccaaaaaaat acttaaacat aacgaagaga 
34441 ttcctacttc atgctttaac taaagttgaa 
34501 attccaaatc aatctgaaac aaacaaagag 
34561 tcatcatgct cacctcattc aaaatgtcta 
34 621 cgggttctct ccctcattcc tagcattttg 
34681 ctttatatgt acttttattc cacgggtttt 
34741 atgatcttcg gagacagaga ataaatcggt 
34801 atggtgatga cgtgaagggg aaaggtgtgt 
34861 caatagagaa gcttgattgg gggtcatcgg 
34921 acaatgacga aaaatgttca aagctcagtg 
34981 ccaaattact tatgcacatc tttcaattaa 
35041 tatttgatag ttttattttc acaaaaaatg 
35101 tatcttttaa aaattaaaaa aaaaactaaa 
35161 actgtcaaaa aatggcaacc actcaatttt 
35221 atattacaaa ttttttttac tgattttagg 
35281 ctttgaaagt ccaatttttt aatattccaa 
35341 tttttaattt ttttggtaac acatgtagtt 
35401 agattacttt aataattact gcacaatctc 
35461 ctttttgaat tttcaagcct tgttgtttca 
35521 taatcttaaa gtaattttaa actcacatca 
35581 tgttgttttt catgctgtac ggcatgcaaa 
35641 ttcaccagta gatattatca atttcttttt 
35701 taaaaccata gttttttttt caatttattc 
35761 acttcatagc ttcatgtagg cgtaatagct 
35821 ccgactctgc cctcccacat ctcaacgtct 
35881 cgccttctct gtcgtcatcg acgcgaatct 
35941 ctaaatatat catatcttag ctatgaactc 
36001 ttctttgcaa tttcaagcgt cattttccca 
36061 cttgtttggt ggatattgta ttgtaaagaa 
36121 gtatttgctg catttacatg cttggtatgt 
36181 tgcagaccgg caattctaca aaataatcat 
36241 agaaaaatat ttgttttcct tatctgatag 
36301 aaagtgctcg aaattgcttg aaaaatactg 
36361 tcaactttta gataattttt gagcgttttt 
36421 ttcagataat tcaaatattt ttctagattc 
36481 aaaattttga gagaaaatgt cccactggat 
36541 cagaaaaagt gtaatgaaaa ctttggaggt 
36601 ttaaataaat gctccaggtt acttgaataa 
36661 tttccgacgc tacttggata actctataag 
36721 ttttaaagcc tggctccaaa ctcagacatt 
36781 tctcattttt caggacaatc ctcacagact 
36841 tgtttttccc ggtactggtc ggtatttccg 
36901 tgtggattca ggtgttttgt gcacaacttg 
36961 agagatgaca acaataaaca aatagagttc 
37021 cttgttagag ggtttttgag caaattgtta 
37081 gaaaattttt ccctacgtca taaatctatc 
37141 gagagcaaga aattattgaa gcgtgcctaa 
37201 ggacgcttca aatagtttct agtctctaat 
37261 ttaagctaat taaaaattcg cgtgactgag 
37321 attttgaagt gttgtagcac aaatgagatg 
37381 ttgagcattt cagtcataaa acacattaac 
37441 ttcaaatttg tttttctcaa cgttgtatcg 
37501 tatagaaaaa attttgaatt ttttgaaaat 



cgtcactcaa tttctgtttg tattgtgttg 
agactttgaa gaagttcttg cacaaacttg 
gctcgctcgg catggtgtgt tgaagttttt 
aatatttgaa ttatttgaaa atgacttttt 
attgactcaa agtcgcaatg tgtcgagtga 
taattcttga gcaaattttt acaaatttca 
aagtattttt ctatatttct atataaattt 
aaaaagggcc tgaaaacggc cttacataat 
tatattccaa ttttcaggag tcgaccatcc 
cagctcggca catgtgacat ctcttgagga 
gatcatactt gcttactcaa atcagataca 
cacatacata tatgtattac taggttcatg 
accttgattc cagtccttgt aattagaatg 
attgtttttt gatttcttga tgattggtga 
tttggatgga atttgacgat tttcagacgt 
ggaattatag gcgctaatta ttgaatgaaa 
atctatagcg gaaattagtt aagagacttg 
tctgagtggt tgtttacact tggagaactg 
tgtcaaagcg cataaactat atgcagcaaa 
tattaaacaa aaaagttaac atttttattc 
cgactatcaa aaaaaagaat cgaaaagttt 
ggatcaaaca tgcaagaatg ctcgaaactt 
tttagcaaaa ctgacagtaa atattgaaca 
gtccagttaa tttgaaactt gccaatttca 
tactagaatc gattttttga tctaagaagt 
acagtttgta acgtcaacta aaaatatgta' 
atagacttga acttaccaag gtttccgtac 
attatttatt tgaatacagt taattgcaat 
ttttccaaaa acatcaagct cgaaaaatga 
gaaattttct cagtaaaaga ctgtcccatg 
tcaattcata ataaaaattg ttctagatca 
cccgatttga aaatttcttt gctcttcgct 
aaccagatat cacttcgtcc tttcgtaact 
gtcttgcaca attatattca cttattggga 
tatcataatt tgtatcactt ttccaattaa 
aacaccgagg aatcgaagaa gtgccagatg 
agactactta ggcttagttg attccaggga 
gttcttcttt aaagtttagc gatgggattt 
tcatttttat tttaattgaa acaattgaaa 
ataatctgtc ggtttttttt tcaacaattt 
tagatagctc caattaaact aaaaattatt 
aaatattttt tctagactaa actttaataa 
cattttgtag aatcgcagta agctgacaaa 
tcaaaaaaat aaatgttttg aaatatatat 
ggcaaaaagt atttatatac acttttaaat 
ttttcaatat tattttgtaa aacttatagc 
catagctttc ccccatctcc aaaatttttc 
tgctcatcgg tggtctcgtt tgtccataca 
ccggcgtgct cagttggttt tcagttcccg 
gattgggtgg tcagttttgt tttttttggg 
gcccacgggg accgggggac gaggaaaaaa 
tggatatcag tgagactttg agggatttgg 
tgcaacatct attcttagtt aagaataatc 
caacttcaca gaaattttat gtttgtttag 
gattttttgt gatttctgca aattaaactt 
ctgttctgaa aatctgaatt tgaaattcta 
agcacatcaa gcatttcatc tttcatttca 
actgcactga actcgccaca aaaattaaat 
acgttttttt tttgggaatt tgcataaaga 
aacggtactt ttaactggaa atcttcgaaa 



37561 aattgctcaa 
37621 gaaatttgag 
37681 tcggaaagta 
37741 tgtcgattag 
37801 gatgctctaa 
37861 aaaaacctca 
37921 atcgccacaa 
37981 cattctacgc 
38041 tacctgatcc 
38101 ttattgaatt 
38161 caatcgaatg 
38221 tttttgctat 
38281 atgagcttaa 
38341 gatatgattt 
38401 catcaatcaa 
38461 acattttcag 
38521 ttcaattgtt 
38581 ataaaattgt 
38641 acttccccca 
38701 ctgccgaccc 
38761 cgactccaaa 
38821 gatctctcac 
38881 tctctacacg 
38941 tttacacttt 
39001 atgaccgtgt 
39061 ggtgataata 
39121 ggcttcttgt 
39181 gcgaattact 
39241 cctattttct 
39301 catgtgtagc 
39361 accgatttac 
39421 cggatcgtgc 
39481 gaaagcccaa 
39541 gcaatccttg 
39601 gctagggcca 
39661 tcggacaagc 
39721 tcggcgaaaa 
39781 agcccaattg 
39841 ctcaagaccc 
39901 aagcgttcgc 
39961 gtgcgcattt 
40021 gctattttta 
40081 aaaaacaaaa 
40141 ccttatgata 
40201 ttgaacgtgt 
40261 tgttcagcca 
40321 gtgcaatccg 
40381 tgaaatgaaa 
40441 aactacaatg 
40501 gagagaggtg 
40561 cacggaatgt 
40621 agccaattcg 
40681 aactttctcg 
40741 aaattaaaat 
40801 ttattaccca 
40861 ataaactttt 
40921 ttaagtgttt 
40981 ctatgaaaaa 
41041 ttattgaagg 
41101 gtaattttga 



aaacctactc 
tcagttttta 
cacatctttt 
aaaagtgggg 
gaattacaga 
attttctgaa 
tacgctggtg 
cgtcaacgga 
agaatgggca 
tcaatgttcc 
tttttgtttt 
tttttctgac 
aaacacacat 
tccaagtgag 
caatgtttgc 
gtttcaatgc 
tggcagcagt 
attctcccgg 
acgtggggcg 
gctttgcatg 
tgcctaccca 
ataacatgta 
gaatgtgctc 
ccgttttgac 
cccccgccac 
aaaaaaattt 
ttcacttcaa 
ttcaaatgaa 
cgtaatcaat 
ttttttgcga 
agacaatgac 
tccgaaacac 
agcttgagcg 
aaacaccgtc 
tctacccgta 
cacatcatct 
cggtaaaaaa 
atttctagtc 
attatgcgcg 
agaaacgagc 
tgtttggccg 
tttgagttct 
ctctcgattt 
taaaattttt 
agaaaatgat 
agaaatcatg 
aagacaaagg 
acttatgtat 
atattctgaa 
ttgagggggt 
gttatattat 
ttttggtaga 
cttcttcttt 
atgaagaaga 
gacgaaagac 
gtttaatgca 
tcattgtgtc 
tcaatgcaat 
ggcacaccta 
cggtaatatt 



aaacatcaac 
ggcagcttct 
tctatatcat 
ttttaaggga 
ctaaccatgt 
attccaggtc 
cgcgggagat 
atctatgcgc 
aaactgattc 
tatatttcag 
ttcaatggat 
agaagtattc 
gtcaccaacc 
cttttacttc 
cacgccttcc 
ctgtcgttgt 
ataatcagag 
ccacacatca 
taactttttt 
cctacaatgg 
tacctatatc 
catacgttca 
aaccatcaca 
gtgtttctat 
cgttacacaa 
ttaagttgaa 
caactcggta 
ggttccgcaa 
agcgggaaat 
aaatattttc 
tcattctgtg 
ctcgatttac 
ttacgaatgc 
caaagccgct 
caatcgcaaa 
cgtccagtcg 
agtttaacgc 
aatctcctta 
tgtttgcgaa 
ataatgcctc 
ttattcccat 
ctgtttattg 
agaaggaact 
gatgaaattt 
cagtttgata 
gaaagtctag 
tcgagacgtt 
cgaataaaaa 
atatacaaga 
gaaagatacg 
tagataacag 
ccatgatcca 
gctttctcca 
aatgctaatc 
tgtttttttc 
aggaagtgtg 
atttgagttt 
tttttttcag 
tttgcattta 
aataatatgt 



gtgccgagtg 
tgaaaatttt 
caaataactt 
caaattttga 
gattaacaga 
tcgccacggc 
ttaaaattga 
ttcttttttt 
ttctaaaagt 
attcttccgt 
atatcgtcag 
gtactttgca 
accagaaaat 
tgttttcttt 
gtgtatcgct 
catcgttttg 
taagtcatgg 
caagctggga 
tgggcgagta 
gcagtaggcg 
cctgccatac 
gtgttcaaca 
atgatctgcc 
tatgatccaa 
caaaaacatt 
acttaattta 
atttttaatt 
tgatggttgt 
tactagggtt 
gtggcgagaa 
gtagttcgtg 
gacgtggcag 
ctggttctat 
ctgctcagga 
aacattcgcc 
ctcaattttg 
agtcagcagg 
cacaggtgat 
cgaggacctc 
gccgatgggc 
cattaaagcg 
actgtttttt 
tgtcagattt 
ttttgttttg 
tttttcacaa 
agttgggtca 
ggcacatggg 
tgtatctgaa 
aattcaaaaa 
gcgggttaac 
aggtttcagt 
ttgaaggctt 
ttgcagaaga 
agaaatgtga 
tctagaaata 
cgcctttaag 
tcaagcaatt 
tagcaacaag 
acaaaaaatg 
. ttacctactg 



aatgtgccaa ttgacacatt 
aattggaatt taggttttta 
ttccaaaaat aacctagaca 
aagtaggttt caaactttca 
ataaaaaatg ctattttcca 
tatcgtcatg cttttcgaaa 
agccaactgg atccgataca 
actaccagtt tacttcagaa 
agtctgtaga cttttataac 
gccctcatcc gtcatatttt 
ttactgggac ggctggaata 
tccattcaag ttattttctg 
tacaacaaaa attctttatc 
aatttgaaag catttgcgcg 
ggttggtcgt atgaaaatta 
cccatgatct attgttttta 
tgcatccgac cagctaaacc 
ttgctcccaa aatataagag 
gcgagcaaca gcctacctgc 
actaactggc gagtagtagg 
gtgcctgcat aatatcccca 
acattgcgat aatgtgcata 
tgaatccaac atatcgtcat 
ctcttcgaag aagctccgtg 
gaatttttta aacgtttttt 
tttctcgaca attttcatac 
tctatttagt cttgtatgtt 
gcctttaagt agtaaagtaa 
actgtactac tttgaaggca 
atttgagtct ttcaaagttg 
aaaatgtgaa atccgaatat 
tgaagattgt gaatccgaac 
ccaataagta cacggaagtc 
acaagccggt cgagatctac 
attggattga aggacacgtg 
tcgccgacaa tgcagagtat 
attcgaacct acgcgggcag 



tcgccacgga cgcataatgc 
acattatgtg cgtttttgcg 
acaatgcgaa cgctcgcaat 
ctcatttcag ttagtttaaa 
t get gat act aaagtttctg 
cgaaaatcat aagggtccac 
aaaatttttt cgaaaacatg 
gtgtgagagt tcacaatgaa 
tttccgttgt tcaaggcatt 
gagacaaagt ctaaatctag 
tttacataca tcaatgagat 
ataaacacag ggagagagga 
gaatcaaaaa tatttgtgta 
agtgtttgac tgcgtggaag 
ettegtagat tecagegaaa 
gggttttctg aaaaaaaaac 
t act tt teat caaaacgaat 
eggtagcegg tctcgccaca 
gggtactgta gtttaaaaat 
aatttttttt aatcgcaaaa 
ttttcgaact acagtactct 
tegtttcgag gccgggtacc 
attttattag ttttttattg 
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1146 aa linear INV 23-OCT-2003 

C^elegans AGE-1 protein (corresponding sequence B0334.8) 
[Caenorhabditis elegans] . 
CAA91377 

CAA91377.2 GI:6018364 

embl locus CEY62F5A, accession AL110499.1 
embl locus CEB0334, accession Z66519.2 

Caenorhabditis elegans 

^ RQp ^r habditis elegans j . 4- ■ ^ 

i^XIT^ta; Metazoa; Nematoda; Chromadorea; Rhabditida; 
Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. 

1 

Genome sequence of the nematode C. elegans: a platform for 

investigating biology. The C. elegans Sequencing Consortium 

Science 282 (5396), 2012-2018 (1998) 

990696 13 

985191 6 

The C. elegans Sequencing Consortium, 
2 (residues 1 to 1146) 
Swinburne, J. 

Direct Submission t 

submitted (27-OCT-1995) Nematode Sequencing Project, banger 

institute, Hinxton, Cambridge CB10 ISA, England and Department of 

Generics, Washington University, St. Louis, MO 63110, USA. E-mail: 

jes@sanger.ac.uk or rw@nematode.wustl.edu • 387374R 

on Oct 11, 1999 this sequence version replaced gi. ib 1 I* '4 8. 

rndina sequences below are predicted from computer analysis, using 

predictions from Genefinder (P. Green, U. Washington), and other 

available information. 

Current sequence finishing criteria for the C. elegans genome 
seauencing consortium are that all bases are either sequenced 
unambiguously on both strands, or on a single strand with, both a 
dye primer and dye terminator reaction, from distinct subclones. 
FxreDtions are indicated by an explicit note. 

Sis sequence is the entire insert of clone B0334. The true right 
end of clone W02B12 is at 4181 in this sequence. The start of this 
, PGUen ce (1..104) overlaps with the end of sequence Z66521. 
The end of this sequence (41657 41812) overlaps with the start of 
sequence AL1104 99. 

For a graphical representation of this sequence and its analysis 
see: _ http://worrtibase.sanger.ac.uk/per l /ace/eleqans/seq/seauence? 

name=B0334 *. • . * 

IMPORTANT: This sequence is NOT necessarily the entire insert of 

th e specified clone. It may be shorter because we only sequence 

overlapping sections once, or longer because we arrange for a .small 
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FEATURES 

source 



Protein 



CDS 



ORIGIN 



overlap between neighbouring submissions. 

[020104 ell] Sequence correction based on Thierry-Mieg EST analysis. 
Location/Qualifiers 
1. .1146 

/organism="Caenorhabditis elegans" 
/strain="Bristol N2" 
/db_xref= fl taxon: 6239" 
/chromosome^ " II" 
/clone="B0334" 
1.-1146 

/product="C. elegans AGE-1 protein (corresponding sequence 
B0334.8) " 
1..1146 
/gene="age-l" 
/standard_name="B0334.8" 

/coded__by="join (complement (AL110499. 1:6114. . 6178) , 
. complement (AL110499. 1:5742. .6027) , 

complement (AL1104 99. 1:5129. .5689) , 

complement (AL1104 99. 1:3804. .5080) , 

complement (AL1104 99. 1:2872. .3132) , 

complement (AL110499. 1:2446. .2820) , 

complement (AL1104 99. 1:350. .700) , 

complement (Z66519. 2:41436. .41568) , 

complement (Z66519. 2: 40596. .40727) )" 

/note="contains similarity to Pfam domain: PFO0454 
(Phosphatidylinositol 3- and 4-kinases) , Score=271.3, 

E-value=4e-78, N=2; PF00613 ( Phosphoinositide 3-kinase 

family, accessory domain (PIK domain) ) , Score=41.6, 

E-value=6.8e-10, N=l; PF00792 (C2 domain) , Score=96.6, 

E-value=1.6e-25, N=l; PF00794 (Pl3-kinase family, 

ras-binding domain), Score=141.9, E-value=3 . 7e-39, N=l; 

PF02192 (Pl3-kinase family, p85-binding domain), 

Score=172.3, E-value=2 . 6e-48 , N=l 

cDNA EST CEMSC28F comes from this gene 

cDNA EST ykl52c9.3 comes from this gene 

cDNA EST ykl52c9.5 comes from this gene 

cDNA EST U56101 comes from this gene 
cDNA EST ykl92d4.3 comes from this gene 
cDNA EST yk260gl0. 3 comes from this gene 
cDNA EST yk302al2.3 comes from this gene 
cDNA EST yk355b7.3 comes from this gene 
cDNA EST yk360dll.3 comes from this gene 
cDNA EST ykl92d4.5 comes from this gene 
cDNA EST yk260gl0.5 comes from this gene 
cDNA EST yk302al2.5 comes from this gene 
cDNA EST yk355b7.5 comes from this gene 
cDNA EST yk360dll.5 comes from this gene 
cDNA EST yk543gl0.3 comes from this gene 
cDNA EST yk543gl0.5 comes from this gene 
cDNA EST yk601d4.5 comes from this gene 
cDNA EST yk847g08.5 comes from this gene 
cDNA EST ykll26g04.5 comes from this gene" 
/db_xref ="GOA : Q94 125 " 
/db_xref= n SWISS-PROT:Q94125 M 

1 mhvnilhpql qtmveqwqmr erpsleteng kgsllleneg vadiitmcpf gevisvvfpw 
61 flanvrtsle iklsdfkhql feliapmkwg tysvkpqdyv frqlnnfgei evifnddqpl 
121 sklelhgtfp mlflyqpdgi nrdkelmsdi shclgysldk leesldeelr qfraslwart 
181 kktcltrgle gtshyafpee qylcvgescp kdleskvkaa klsyqmfwrk rkaeingvee 
241 kmmkiqiefn pnetpksllh tflyemrkld vydtddpade gwflqlagrt tfvtnpdvkl 
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301_tsydgvrsel esyrcpgfvv rrqslvlkdy 
361 tpkqsknsdm vmtdf rptas lkqvslwdld 
421 vgtltlasks ttkvnaqfak wnkemytfdl 
481 wvnmsltdwr delrqgqflf hlwapeptan 
541 psqgqytylv khrstwtetl nimgddyesc 
601 wmwrryiqkq epdllivlse lafvwtdren 
661 virkfavekl neqlspvtfh lfilpliqal 
721 llraeiarlr dcdlkseeyr risllmeayl 
781 pkdvatmklr delrsishkm enmdspldpv 
841 ksdlhlpfca mifkngddlr qdmlvlqvle 
901 evvpncktif eiqvgtgfmn tavrsidpsf 
961 idntqamkky fesvdrflys cvgysvatyi 
1021 gktklgiqrd rqpfiltehf mtvirsgksv 
1081 lftlmlgmel pelstkadld hlkktlfcng 
1141 havkhy 



crpkplyeph yvraherkla lc^vlsvsids 
anlmirpvni sgfdfpadvd mvvriefsvy 
ymkdmppsav lsirvlygkv kILkseefevg 
rsrigengar igtnaavtie issyggrvrm 
irdpgykklq mlvkkhesgi vZLeedeqrhv 
fselyvmlek wkppsvaaal tILlgkrctdr 
kyepraqsev gmmlltralc d^/righrlfw 
rgneehikii trqvdmvdel txistlvkgm 
yklgemiidk aivlgsakrp lmlhwknknp 
vmdniwkaan idcclnpyav lpmgemigii 
mnkwirkqcg iedekkkskk dstknpiekk 
mgikdrhsdn lmltedgkyf hidfghilgh 
dgnshelqkf ktlcveayev irxwnnrdlfvs 
eskeearkff agiyeeafng swstktnwlf 
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